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INVERSE THEOREMS FOR SETS AND MEASURES OF 
POLYNOMIAL GROWTH 

TERENCE TAO 


Abstract. We give a structural description of the finite subsets A of an arbitrary 
group G which obey the polynomial growth condition \A n \ < n d |A| for some bounded d 
and sufficiently large n, showing that such sets are controlled by (a bounded number of 
translates of) a coset nilprogression in a certain precise sense. This description recovers 
some previous results of Breuillard-Green-Tao and Breuillard-Tointon concerning sets 
of polynomial growth; we are also able to describe the subsequent growth of \A m \ fairly 
explicitly for m ^ n, at least when A is a symmetric neighbourhood of the identity. 
We also obtain an analogous description of symmetric probability measures y whose re¬ 
fold convolutions y* n obey the condition ||^*™[|^ 2 < n d ||//||^ 2 . In the abelian case, this 
description recovers the inverse Littlewood-Offord theorem of Nguyen-Vu, and gives 
a “symmetrised” variant of a recent nonabelian inverse Littlewood-Offord theorem of 
Tiep-Vu. 

Our main tool to establish these results is the inverse theorem of Brcuillard, Green, 
and the author that describes the structure of approximate groups. 


1. Introduction 

In the field of arithmetic combinatorics, it has turned out to be particularly fruitful 
to establish inverse theorems in which some combinatorial hypothesis on an object of 
arithmetic combinatorial nature (e.g. a finite subset of a group) is used to establish a 
much more explicit description of that object. Ideally, these inverse theorems should be 
matched as closely as possible by a converse direct theorem that shows that all objects 
of the given explicit description obey the original combinatorial hypothesis (perhaps 
with some loss in the quantitative constants). A typical instance of this is Freiman’s 
inverse sumset theorem, first established over the integers by Freiman m, and in the 
general setting of abelian groups by Green and Ruzsa m 

Theorem 1.1 (Freiman’s theorem). Let G = (G, +) be an arbitrary abelian group, and 
let A be a finite non-empty subset of G of cardinality |A|. Suppose that the sumset 
2 A = A + A := {ai + a 2 : ai,a 2 e A } is such that \A + A\ ^ K\A\ for some K ^ 1. 
Then there exists a coset progression H + P, where H is a finite subgroup of G and 
P = P(v i ,... ,v r ; Ni ,..., N r ) is a generalised arithmetic progression, that is to say a 
set of the form 

P = {n\Vi + • • ■ + n r i v : ..., n r e Z; \nf\ ^ iVjVl < * < r} 

for some rank r > 0, some Ni,... ,N r > 0 and vi,...,v r e G, such that A c p[ + P, 
\H + P\ < Cqy|A| 7 and r ^ C' K , where Ck, C' k are quantities depending only on K. 
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More explicit values for Ck were given in HU; for the best known bounds on these 
quantities see the survey [23] of Sanders. However, we will not be concerned with 
quantitative values of constants in this paper. The corresponding direct theorem is 
easy: if H + P is a coset progression of some rank r and A is a subset of H + P with 
|A| ^ e\H + P\, then it is easy to see that \A + A\ ^ C' rjE |A| for some quantity C T:£ 
depending only on r, e. 

A closely related variant of Freiman’s theorem concerns iterated sumsets nA = A + 

■ ■ ■ + A of A for large n, in the case that nA enjoys relative polynomial growth in n: 

Theorem 1.2 (Polynomial growth inverse theorem). Let d A 1, and let n be sufficiently 
large depending on d (thus n ^ n 0 (d) for some no(d) depending on d). Let A be a finite- 
non-empty subset of an abelian group G such that \nA\ ^ ?r d |A|. Then there exists a 
coset progression H + P of rank at most Cd such that A c H + P and \H + P\ < C' d \A\, 
where Cd and C' d depend only on d. 


See for instance m Theorem 2.7] for a quantitative version of this theorem with quite 
good values for the constants no(d),Cd,C' d . Again, the corresponding direct theorem is 
easy to establish: if H + P has rank at most r and A cz H + P with |A| ^ e\H + P\, 
then \nA\ ^ C r ^n r \A\ for some C rj£ depending only on r,e. 

Another family of inverse theorems in arithmetic combinatorics are the inverse Littlewood- 
Offord theorems that classify the tuples of elements Vi, ... ,v n of an additive group for 
which the random sum +V\ + ■ ■ ■ + v n exhibits unusually large concentration; see [2T] 
for a survey. A near-optimal such theorem in the case of torsion-free abelian groups is 
the following result of Nguyen and Vu m Theorem 2.5]: 

Theorem 1.3 (Inverse Littlewood-Offord theorem). Let vi,...,v n be elements of a 
torsion-free abelian group G = ( G , +), and define the concentration probability 

p := supP(fiUi + • • • + f n v n = x ) 

xeG 

where £i,... ,£ n e ( — 1, +1} are independent Bernoulli variables, each of which attains 
either sign +1,-1 with equal probability. Suppose that p ^ n~ A for some A > 0. Let 
n' e \n e , n] for some £ > 0. Then there exists an arithmetic progression P of rank r at 
most C £) a that contains all but at most n' of the Vi ,..., v n , such that 

\P\ ^ C £A p~\n')- r/ \ 

where C £t A depends only on e and A. 

Setting n' to be comparable to n and ignoring the exceptional elements that lie outside 
of P, this result can be matched by a corresponding direct theorem; see [2D, Example 
1.7]. Inverse Littlewood-Offord theorems have a number of applications, particularly to 
the theory of discrete random matrices; see I2H for details. 

1 For the sharpest quantitative results, it is preferable to replace the notion of a coset progression 
with the slightly more general notion of a convex coset progression , and to cover A by a bounded 
number of translates of H + P, rather than just by H + P; see [23] for more details. 
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In this paper we will be concerned with analogues of these inverse theorems in the 
context of non-abelian groups G = ((7, ■). To reflect this we now switch to multiplicative 
notation rather than additive notation, for instance using product sets A ■ B := {ab : 
a e A, b e B}, iterated product sets A n = A ■ ... ■ A, etc. 

To state the non-abelian version of Freiman’s theorem, we need to generalise the no¬ 
tion of a coset progression, recalling some notation from [6], Define a nilprogres- 
P = P(v 1 ,...,v r ]N 1 ,...,N r ), where ... ,v r are elements of a group G and 
N ± ,..., N r > 0 are real numbers, to be the set of all evaluations of words formed from 
Vi,... ,v r , vf 1 , ..., vf 1 , where for each i = 1,..., r, the total number of times u* and vf 1 
are used in the word is at most Ah, and such that v±,... ,v r generate a nilpotent group of 
some nilpotency class s, which we refer to as the nilpotency class of the nilprogression; 
the quantity r is the rank of a nilprogression. Next, we define a coset nilprogression 
to be a set of the form HP , where H is a finite group, and P is a subset of the nor- 
maliser N(H ) of H which becomes a nilprogression upon applying the quotient map 
from N(H) to N(H)/H , that is to say the coset nilprogression HP is the pullback of a 
nilprogression in N(H)/H. Note that a coset progression is nothing more than a coset 
nilprogression in an abelian group. We define the rank and nilpotency class of a coset 
nilprogression to be the rank and nilpotency class of the associated nilprogression. 

It turns out to be technically convenient to restrict tell a well behaved class of nil- 
progressions and coset nilprogressions. Following [6], we call a nilprogression P = 
P(u\,... ,u r ] N\,..., N r ) in C-normal form for some C ^ 1 if the following hold: 


(i) (Upper-triangular form) For every 1 ^ i < j ^ r and for all four choices of signs 
+ one has 

CN j+ 1 CN r 


[uf l ,uj'] e P (^u j+ 1 ,... ,u r ] 


NiNj 


NiNj 


Here and in the sequel we use the convention [g, h] := g~ 1 h~ 1 gh. 

(ii) (Local properness) The expressions uf 1 ... uf r are distinct as n \,..., n r range 
over the integers with |rq| < Ty\r f or i = 1 ; ... 5 r. 

(iii) (Volume bound) One has 

if](2ivj +1) s Tl € cflptAy + 1). 


1=1 


i= 1 


We say that a coset nilprogression is in C’-normal form if its associated nilprogression is 
in (7-normal form. Note that for a nilprogression (or coset nilprogression) in (7-normal 
form, the nilpotency class is bounded by the rank r. 

2 A pedantic point: technically, a nilprogression is not just the set P , but is instead the tuple 
(P, r, G, (vi, ..., v r ), (W) • • •) N r )), thus it is possible for distinct nilprogressions to generate the same 
set P by selecting different choices of v \,..., v r or Ah,..., N r . Similarly for the concept of a coset 
nilprogression. This distinction is needed later when we define the dilations P*, HP t of a nilprogression 
P or coset nilprogression HP. However, in the paper we will frequently abuse notation and just refer 
to the set P (or HP) as the nilprogression (or coset nilprogression). 

3 One could also work instead with the closely related nilcomplete progressions and nilpotent pro¬ 
gressions studied in EB3, m- 
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Next, following [25], we define an K-approximate group for some K ^ 1 to be a subset 
A of a group G which is symmetric (thus A -1 := {a" 1 : a e A) is equal to A), contains 
the identity 1, and is such that A 2 can be covered by at most K left-translates of A. 
We then have the following result from |6j, Theorem 2.10], building upon many previous 
results (see [7] for a survey): 

Theorem 1.4 (Inverse theorem for K - approximate groups). Let A be a finite I\- 
approximate group in an arbitrary group G. Then there exists a coset nilprogression HP 
of rank and nilpotency class at most Ck in Ck- normal form, such that \HP\ < Ck|A| 
and that A is covered by at most Ck left-translates of HP. Here Ck is a quantity 
depending only on K. 

The bounds on the rank and nilpotency class can be made quite effective; see HH The¬ 
orem 2.12], However, the bound on the size of HP, the normal form, and the covering 
number are currently ineffective, due to the reliance in [6j on results related to Hilbert’s 
fifth problem. As we will be relying heavily on Theorem 11.41 in this paper, the bounds 
in our results are similarly ineffective. As far as the corresponding direct theorem is 
concerned, one can check that a coset nilprogression HP of rank r and nilpotency class 
s in C'-normal form is a C r)Si c-apP r c> x i m ate group for some C TyS ,c depending on r, s, C; 
see [6] Lemma C.l, Remark C.2] or [9]. This direct theorem is not a full converse to the 
above inverse theorem, since one has to also consider the situation where A is covered by 
a bounded number of left-translates of HP rather than being equal to HP, but shows 
that the description of approximate groups given in Theorem 11.41 is somewhat close to 
optimal. 

Now we turn to nonabclian analogues of Theorem 11.21 We first recall two results in the 
literature, which were proven as consequences of Theorem 11.41 

Theorem 1.5. Let A be a symmetric finite subset of a group G containing the identity, 
let d > 0, and suppose that \A n \ ^ n d \A\ for some n that is sufficiently large depending 
on d (thus n ^ n 0 (d) for some sufficiently large n 0 (d ) depending only on d). 

(i) [0 Theorem 1.13] The group (A) generated by A is virtually nilpotent. Indeed, 
(A) has a subgroup G' of index at most Cd, which contains a normal finite 
subgroup H such that G'/H is nilpotent of rank and nilpotency class at most C' d , 
where Cd, C' d depend only on d. 

(ii) [3 Theorem 1.1] For any natural numbers m ^ n and k > 1, we have |A fcm | ^ 
C d \A m \, where Cd depends only on d. 

These results give a significant amount of control on A; for instance, the first claim 
(i) easily implies Gromov’s theorem na that finitely generated groups of polynomial 
growth are virtually nilpotent. However, these statements are not complemented by a 
matching direct theorem; the conclusions (i), (ii) do not obviously imply a bound like 
|A n | n d \A\. 


1.1. New results. Our first main result gives an inverse theorem that comes with a 
matching direct theorem. To state it, we need some additional definitions. Given a 
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nilprogression P = P(v i,.. ., v r ; Ni,... , N r ) in a group G and a parameter t > 0, we 
define the dilation P f by the formula 

p* ■= P(v 1 , ...,V r -, tNi, . . . , tN r ). 

Note that this agrees with the existing definition of P l as an iterated product when t is 
a natural number. Given a group element g e G, we define the “norm” ||g||p e [0, +oo] 
by the formula 

\\g\\p := inf{t e [0, +oo] : g e P 1 }. 

It is easy to see that \\g\\p = 0 if and only if g = 1, that ||g _1 ||p = ||gj|_p, and ||^/i||p ^ 
II^Hp + ||h||p for all g,h e G, thus justifying the denotation of ||||p as a “norm”. One 
can think of the nilprogression P as being analogous to a symmetric convex body in a 
vector space, in which case ||||p is analogous to the norm generated by that body as a 
unit ball. Informally, elements g & G which are small in ||||p norm will almost preserve 
P by left multiplication: gP % P. 

Similarly, given a coset nilprogression HP in a, group G and a parameter t > 0, we 
define the coset nilprogression HP t to be the pullback of the nilprogression (. HP/H )* 
under the quotient map from N(H ) to N(H)/H, and then define 

WqWhp ■= inf{t £ [0, +oo] : g e HP 1 }. 

Again we have ||( 7 _1 ||pp = ||<?|hp and |g/i||//p < ||c/||pp + ||h||p\p for all g, h e G ; further¬ 
more one has ||g||p = 0 if and only if g e H. Thus one can view ||||pp as a “seminorm”. 
As before, elements g e G which are small in \\\\hp seminorm will (informally speaking) 
almost preserve HP: gHP % HP. 

Finally, we need a “virtual” extension of the “seminorm” ||||p-p. Given a finite non¬ 
empty set X c G and a group element g e G, we define the “seminorm” ||.g|l hp.x by the 
formula 

II 9 1| hp, x := inf sup ||a(x) _1 5 fT||pp, (1.1) 

o-GSym(V) xe x 

where Syrn(X) denotes the group of permutations a : X —> X on the finite set X. Again, 
we have ||<A 1 ||ipp,x = ||^||hp,x and \\gh\\HP,x ^ ||<7|#p,x + ||^||hp,.y- If ||fi , ||p'p,x = 0 then 
gXHP = XHP ; more generally, one should think of elements g that are small in ||||jrp,x 
seminorm to approximately preserve XHP , thus gXHP % XHP. 

Example 1.6. Consider the dihedral group {—1,+1} x Z of maps x *-> ax + b with 
a e { — 1,+1} and fee Z. The set of translations P = {x ^ x + n : \n\ ^ N} is a 
nilprogression, and hence also a coset nilprogression HP if we take H to be trivial. If 
we let A" consist of the identity map x ^ x and the reflection x i-» —x, then a map g of 
the form x •—> ax + b with a e { —1, +1} and 6 e Z will have seminorm ||(?|| jpp,.Y = \b\/N. 


We can now state our first main theorem, which we establish in Sections EH 

Theorem 1.7 (Inverse theorem for polynomial growth). Let A be a finite non-empty 
subset of a group G, let d > 0, and suppose that \A n \ < n d \A\ for some n that is 
sufficiently large depending on d. Then there exists a coset nilprogression HP of rank 


6 


TERENCE TAO 


and nilpotency class at most Cd in Cd-normal form, and a finite subset X of G of 
cardinality at most Cd containing the identity, such that 

HPc(A u {1} u A~ 1 ) Cdn (1.2) 

and such that 

A u {1} u A 1 c |<7 e G : |< 7 |#p,x ^ — | , (1.3) 

where Cd is a natural number depending only on d. 

Example 1.8. Continuing Example 11.61 if we take A to be the set of maps x >->■ ax + b 
with a e { — 1, +1} and | 6 | ^ N/n for some 1 < n < N then we see that the hypotheses 
and conclusion of Theorem P are satisfied with the given value of HP and some 
absolute constants d, Cd■ 

The conclusion of this theorem gives quite a bit of information about A. For instance, 
if m is a natural number, we see from ( 1 1 . 2 [) that 

HP m c (A u {1} u A~ l ) Cdmn 

and from (11.31) and the triangle inequality we see that every element of {A u {1} u 
d'i) Cdmn has an ||||#p : x norm of at most C%m, which in particular implies that such 
elements lie in XHP c d m (since X contains the identity). Thus we have the inclusions 

HP m c (A u {1} u A- 1 ) 0 *™ c XHP C > (1.4) 

which show that the growth of ( A u {1} u A^ 1 ) 77171 is essentially controlled by that of 
HP m . (This latter observation was already implicitly used in [9] to establish Theorem 
ll.5l iib) The m = 1 case of (11.41) also shows (in the case when A is a symmetric 
neighbourhood of the identity, so that A = A u {1} u A~ l ) that under the hypotheses 
(11.2)) and (11.31) . the condition \A n \ ^ n d \A\ is equivalent (up to constants) to the lower 
bound \A\ ^ n~ d \HP\ , giving a direct theorem to match the inverse theorem. 

Using m, together with a detailed analysis of the growth of coset nilprogressions 
HP m , we will be able to obtain the following result, which gives a more precise form of 
Theorem II.Sl id : 

Theorem 1.9 (Further growth of a locally polynomially growing set). Let the notation 
and hypotheses be as in Theorem E3 Then there exists a continuous piecewise linear 
non-decreasing function f : [ 0 , +oo) —» [ 0 , +co), with /( 0 ) = 0 and f having at most Cd 
distinct linear pieces, each of which has a slope that is a natural number not exceeding 
Cd, such that 

| log | (A U {1} U A~ l ) mn \ - log | (A U {1} U A - 1 ) 71 1 - f (log 7Tlj | < C d 
for all natural numbers m ^ 1, where Cd depends only on d. 

We establish this theorem in Section [4j As remarked previously, the inclusions in (11.41) 
were essentially already obtained in [9], so this result should be viewed more as a 
complement to Theorem 11.71 than a consequence of it. 


We illustrate Theorem 11.91 with two examples, one which shows that the slope of / can 
decrease over time, and the other showing that it can increase. 
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Example 1 . 10 . Let G be the additive group G := Z/iV 3 Z x Z/7V 3 Z for a large natural 
number TV, and let A c G be the set A := {— N ,..., N} x TV 2 ,..., N 2 }. Then one 
can compute that 

log \mA\ = log |A| + /(logm) + 0(1) 

where f(x) is equal to 2x for 0 < x < log N, equal to x + log N for log N ^ x < 2 log N, 
and equal to 2 log N for x ^ 2 log N. This gives an example of Theorem 11.91 (with n and 
d equal to suitable constants, independent of N) in which the slopes of f(x) decrease 
as x increases. 

/l Z Z\ 

Example 1 . 11 . Let G be the Heisenberg group 0=10 1 Z , let TV be a large 

V° 0 V 

natural number, and let A c G be the set consisting of those matrices 

a, b, c e Z, |a|, |c| < N, and \b\ < N 3 . One can then compute that 

number m, {A u {1} u A _1 ) m is commensurate with the set of matrices 

|a|, |c| ^ mN and |6| < mN 3 + m 2 N 2 . As such one has 

log |(A u {1} u A' 1 )” 1 ! = log |A| + /(logm) + 0(1) 

where f(x) is equal to 3x for 0 < x < logA^, and equal to 4x — logA^ for x ^ log N, 
thus giving an example where the slopes of f(x) increase as x increases. 


(l a b\ 

I 0 1 cl with 

V° 0 V 

for any natural 
/l a b\ 

10 1 cl with 

V° 0 V 


One can take a Cartesian product of these two examples (with different choices of N) to 
produce an example of a function / which is neither convex nor concave; we leave the 
verification of this construction to the interested reader. We conjecture that an analogue 
of Theorem 11.91 holds with (A u {1} u A” 1 ) m?l replaced by A mn , but we will not pursue 
this question here. Theorem 11.91 may also be compared with the result of Khovanskii 
ca, which asserts that for any finite subset A of an abelian group G, the cardinality 
\A n \ is a polynomial function of n for sufficiently large n. For comparison, Theorem 11.91 
shows that the function m *-> |A m | is comparable to a piecewise polynomial function of 
m for m ^ n, where the degree and number of pieces of this function, as well as the 
comparability constants, are bounded by a constant depending only on d. The result 
in [1])J has been extended to abelian semigroups (see fTS, IS]) and to virtually abelian 
groups (see E2j); there are partial extensions to the virtually nilpotent case [3], but there 
exist nilpotent groups for which |A n | is not eventually polynomial [23], although it is 
always asymptotic to a polynomial [8]. 

Now we turn to the analogous problem for measures. Suppose that one is given a 
discrete probability measure fi on a group G = (G, ■), or equivalently a non-negative 
function fi : G —> M + with XizeG n(x) = 1. The convolution fi * v : G —> M + of two such 
probability measures, defined by 

H * u (x) : = ^ Li{y)v{y~ l x) 

yeG 
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is again a probability measure. We denote by p* n the convolution of n copies of p. Note 
that if p is symmetric (by which we mean that p(x^ 1 ) = p(x) for all x e G), then p* n 
is symmetric for all n. 

The quantity ||/i||^ G ) := (SxeGA f ( x ) 2 ) _1 ) which is a quantity in the interval [l,+oo), 
is a measure of how broadly the probability measure p is supported. For instance, if 
p = pq-1 a is uniform measure on a finite set A, then is equal to |H|. The 

quantity ||)«* n ||^ G ) is then analogous to the quantity \A n \ discussed previously. 

From Young’s inequality we see that the quantity ||/d m ||^ G) is non-decreasing in n. The 
following inverse theorem, analogous to Theorem 11.71 describes those measures p for 
which ||/^* n ||^ G ) grows at most polynomially in n: 

Theorem 1.12 (Inverse theorem for polynomial growth of measures). Let p be a sym¬ 
metric probability measure on a group G, let d > 0 and e > 0, and suppose that 

ll/ i * n || P(G) ^ nd Mp\ G ) (1-h) 

for some n that is sufficiently large depending on d and e. Then there exists a coset 
nilprogression HP of rank and nilpotency class at most Cd, £ in C d , £ -normal form, and 
a finite subset X of G of cardinality at most C d , £ containing the identity, such that 

\HP\ ^ C d Jp* n \\f 2 \ G) (1.6) 

and such that 

hp,x dp{x) < — (1.7) 

’ n 

for some exceptional set E with 

fi(E) < %• ( 1 - 8 ) 

rr e 

Here C d , e is a quantity depending only on d and s. 



We prove this theorem in Section [5] Given that Theorem 1 1. 71 does not require symmetry, 
it is reasonable to expect that some form of Theorem 11.121 can also be established for 
non-symmetric fi, but we do not pursue this issue here. Some exceptional set E must 
be permitted in the above theorem; indeed, given a symmetric probability measure /i, 
one can consider the modified measure // := (1 — 5)fi + 5u for an arbitrary probability 
measure v and a small 5 > 0, and then 


(dT n \\p 2 ^ (1 - h)“ 2n 




-2 

P 


which shows that one can modify p more or less arbitrarily on a set of measure 0(l/n) 
without significantly increasing the quantity ||/r* n ||^ 2 . However, the n £ loss in (11.811 is 
somewhat undesirable (as we shall see, it matches the n £ loss in Theorem II.3H . and it 
may be possible to remove it at the cost of making the conclusion more complicated. 


For the corresponding direct theorem, we have the following result, which is a converse 
to the above theorem if we remove the exceptional set E. 
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Theorem 1.13 (Direct theorem for polynomial growth of measures). Let p be a discrete 
symmetric probability measure on a group G. Let HP be a coset nilprogression in G of 
rank and nilpotency class bounded by M, in M-normal form, let X be a non-empty set 
of cardinality at most M, and suppose that 



dfi{g) 


M 


n 


Then one has 

\\T* n \\f? ^ C m \HP 

for some quantity Cm depending only on M. 


We prove this result in Section [HI it is essentially a quantitative analysis of random 
walks on virtually nilpotent groups. Under additional hypotheses on X and p, it is 
possible that one could (in the spirit of 0) obtain some sort of “central limit theorem” 
that describes p* n more precisely, but we do not pursue this issue here. 

When the group G = ( G , +) is abelian, one can use Theorem II. 121 to recover the inverse 
Littlewood-Offord theorem in Theorem 11.31 we do this in Section |T1 

Now we turn to the question of obtaining non-abelian analogues of Littlewood-Offord 
theory. This question was recently investigated by Tiep and Vu [2?j. We mention just 
one of their main results: 

Theorem 1.14. Let m, , n, s ^ 2 be integers, and let Ai,A n be matrices in SL m (C), 
each of which has order at least s. Let A 1 ,... ,A n be the independent random matrices 
selected by choosing A t to equal A j or Af 1 with equal probability, for each i = 1,... ,n. 
Then 

141 

SUp P(Ai ...A n =B)^ —— - -j=r. 

Be SL m (C) min (s, y/n) 


While our methods cannot recover this type of result exactly, we can obtain the following 
related result involving a “symmetrised” form of the Littlewood-Offord problem, proven 
in Section HI 

Theorem 1 . 15 . Let n ^ 2 and 0 < e < 1. Let A 1 ,.. ., A n be elements of a group G = 
(G, •), and let A' x ,, A' m be the independent identically distributed random variables 
with each A[ selected to equal one of A 1 , ..., A n , Af 1 , ..., A~ x with equal probability. If 

sup P(Oj .. . A' n = B) > —^ (1.9) 

BeG By/n 

and n ^ O/e 4 for a sufficiently large absolute constant C, then there exists a finite 
subgroup H of G of order at most Ceyfn which contains at least (1 — Ce 2 )n of the A j. 


Note if Ai are as in Theorem 11.141 then none of the A x cannot be contained in any 
subgroup of order less than s, so from Theorem 11.151 applied in the contrapositive we 
see that 


supP(^i...< 

BeG 


B)^ 


C 

min(s, yfn ) 
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for some absolute constant C, at least in the regime when s ^ Cn 1 ^ 4 (actually one can 
replace the exponent 1/4 here by any other positive constant, as can be seen from the 
argument below). Thus our theorem differs from Theorem 11.141 in that the constant C 
is not explicit, and we can only control the random walk A\ ... A' n as opposed to the 
ordered random product A±... A n , and one needs some lower bound on s. 


Let n be a finitely supported symmetric probability measure on a group G. By results 
of Gromov and Varopoulos (see e.g. [30]), it is known that ||^* n ||£°o( G ) decays at a rate 
» n ~ d / 2 if and only if /i is supported in a subgroup G' of G of polynomial volume growth 
of degree at most d (thus |5' n | < Csn d for all finite sets S in G and all n); see e.g. [30]. 
By modifying the proof of Theorem 11.151 we can obtair[] a non-asymptotic variant of 
this result: 


Theorem 1.16. Let d be a natural number and let e > 0. Let n be a natural number 
that is sufficiently large depending on d, e, and let p be a symmetric probability measure 
on a group G. Suppose that 


,,*n || 

H lk°°(G) 


^ n -(d+1-£)/2 , 


then there exists a subgroup G' of G of polynomial volume growth of degree at most d 
such that p(G') ^ 1 — e. 


We prove this result in Section [8] 
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Breuillard and Melvyn Nathanson for help with the references. We also thank the 
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1.3. Notation. We will rely heavily on definitions, notations, and theorems from the 
paper [6j, which also established Theorems 11.41 and ll.5f i) above. As such, some famil¬ 
iarity with that paper will probably be required in order to easily follow the arguments 
given here. 


2. Nonstandard analysis formulation 

To remove some of the “epsilon management” in the arguments, and also to more 
easily access some results from [6] that are phrased in a nonstandard setting, we will 
convert the main results of our paper to a nonstandard formulation. We will use the 
nonstandard framework based on a single non-principal ultrafilter a e /3N\LT, as laid 
out in P Appendix A], and will freely use the notation from that appendix in the 
sequel. In particular, we have the asymptotic notation A" = ()(Y). X « Y, or A" » Y 

4 We thank Emmanuel Breuillard for suggesting this variant. 
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when |A'| < CY for some standard C, and X = o(Y) if one has |X| < eY for every 
standard e > 0. We say that a quantity X is bounded if X = 0(1), and write X ~ Y 
for X « Y « A. 

We define a nonstandard group (or internal group ) to be an ultraproduct G = G n 

of (standard) groups G n , a nonstandard finite set (or internal finite set ) to be an ultra¬ 
product A = Ai °f (standard) finite sets A n , and so forth. Note that an internal 

finite set A has an internal cardinality |A| = lim n ^ Q |A n |, which is a nonstandard finite 
number. 

We define an ultra approximate group to be an ultraproduct A = n^a Ai of (stan¬ 
dard) sets A n \, which are all A'-approximate groups for some standard number K in¬ 
dependent of n. Similarly, define an ultra coset nilprogression to be an ultraproduct 
HP = [] l) _ (a H n P n of (standard) coset nilprogressions H n P n whose rank and nilpotency 
class are bounded uniformly in n; thus HP itself will have rank and nilpotency class 
which are standard natural numbers. If g e G, then ||g||p-p is well-defined as a nonstan¬ 
dard element of [0, +co]; similarly for ||<?||,f/p,s if S is a nonstandard finite set. We define 
the notion of an ultra nilprogression similarly (discarding the finite group H). 

An ultra nilprogression P(ui,.. ., u r ; Ni,... , N r ) (with Ni,... ,N r now nonstandard 
reals) is said to be in normal form if it obeys the following axioms: 


(i) (Upper-triangular form) For every 1 ^ i < j r and for all four choices of signs 
+ one has 




6 P 


'U'j + 15 • • * 5 5 


Q(N j+ 1) 

NiNj 


Q(N r ) \ 

NiNj’ 


(ii) (Local properness) The expressions uf 1 ... ufi are distinct as n \,..., n r range 
over the nonstandard integers with n; = o(Nfi for i = 1 ,... ,r. 

(iii) (Volume bound) One has 


p i * row] +1)- 

1=1 


An ultra coset nilprogression is said to be in normal form if its associated ultra nilpro¬ 
gression is in normal form. 

Theorem 11.71 then follows from (and is in fact equivalent to) the following nonstandard 
analysis statement. 

Theorem 2.1 (Inverse theorem for polynomial growth, nonstandard formulation). Let 
A be an non-empty internally finite subset of a nonstandard group G, let n be an un¬ 
bounded natural number, and suppose that \A n \ ^ n 0( ^\A\. Then there exists an ultra 
coset nilprogression HP in normal form, and a finite subset X of G of bounded cardi¬ 
nality containing the identity, such that 

HP c (A u {1} u A -1 )° (n) 

and such that ||g , ||_pp ) x = 0{l/n) for all g e A. 


( 2 . 1 ) 
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Let us see how Theorem 12.11 implies Theorem 11,71 Suppose for contradiction that 
Theorem 11.71 fails. Carefully negating the quantifiers, we conclude that there exists 
d > 0, a sequence G n of (standard) groups, finite non-empty subsets A n of G n , and a 
sequence n n of natural numbers going to infinity such that 

Kl^nVnl 

for all n, but such that for each n, there does not exist a coset progression H n P n in G n 
of rank and nilpotency class at most n in n-normal form and a finite subset X n of G n of 
cardinality at most n containing the identity such that 

H n P n c= (A n u {1} u A- 1 )™" 

and 

A n u {1} u A k 1 C= \ g n e G n : \\g n \\H n p n ,x n < — f • 

l n n J 

We now take ultraproducts, forming the nonstandard group G := rin-x* and the 
internally finite subset A := rin^o Ai, and the nonstandard natural number n : = 
lim n _> a n n . By hypothesis, n is unbounded and \A n \ ^ n d |A|. Thus by Theorem 12.11 
there exists an ultra coset progression HP in normal form and a finite subset X of G 
of bounded cardinality containing the identity, such that 

HPc (A u {1} u A” 1 ) 011 

and such that ||<7||.ffp,x =< C/n for all g e A, and some standard C. Writing HP = 
n^n—>a H n P n and using Los’s theorem (see e.g. [6] Appendix A]), we see that for n 
sufficiently close to a (and enlarging C if necessary), H n P n is a coset nilprogression of 
rank and nilpotency class at most C, in C'-normal form, with 

H n P n C (A n u {1} u A- 1 )^" 

and such that ||fi f ti||p n p n ,.Y n C/n n for all g n e A n . But this contradicts the construction 
of the A n for n large enough. 

3. Inverse theorem for polynomial growth of sets 

We now prove Theorem 12.11 Let A and n be as in that theorem. Since \A n \ « ?r°d)|A| 
and n is unbounded, we see from the pigeonhole principle that there exists an unbounded 
n-o « n such that |A 100no | « |A n °|. As we will see at the end of the argument, it would 
be convenient if we could take no ~ n, and from Theorem ll.5l ii) we see that we may 
do so when A is symmetric and contains the identity; however, we are not assuming 
symmetry on A, and so we will have to temporarily allow for the possibility that no is 
much less than n, and return to address this issue at the end of the argument. 

From [25, Corollary 3.11] we see that ( A n ° u {1} u A~ n °) 3 is an ultra approximate group 
of cardinality ~ |A n °|. In particular we have |(A n ° u {1} u A~ n °) m \ ~ |A n °| for any 
standard m. 

If we now applied [6} Theorem 4.2], we could conclude that the ultra approximate group 
(A n ° u {1} u A~ n °) 12 contained an ultra coset progression HP in normal form with 
\HP\ ~ |A n °|. However it will be convenient to impose an additional “N-properness” 
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hypothesis on P that strengthens the local properness property of normal form; this 
strengthening is not explicitly provided in [6J Theorem 4.2], and so we will repeat some 
of the arguments in [6] to obtain this refinement: 

Proposition 3.1. There exists an ultra coset nilprogression HP c : (H n0 u{l}uH _ri0 ) o ( 1 ) 
with \HP\ — \A n ° | such that the associated ultra nilprogression P(ui, ..., u r ] Ni, ... , N r ) = 
HP/P in normal form obeys the additional property: 

(ii’) (N-propern ess) The group elements u/ 1 ... u/ r with ri\ = 0{Nf ),..., n r = 0(N r ) 
are all distinct. 


Proof. We first apply [6J Theorem 10.10] to conclude the existence of an ultra approx¬ 
imate group A c= ( A n ° u {1} u A~ n °)°^> with \A\ ~ \A n °\ which enjoys a global model 
f> : ( A ) —* L (as defined before [6, Proposition 6.10]) into a connected, simply connected 
nilpotent Lie group L. Here and in the sequel (A) denotes the external group generated 
by A, that is to say the set of all words in A of bounded length. The crucial property 
here is that L is simply connected; in (BJ Theorem 10.10], this is basically accomplished 
by quotienting out a maximal compact subgroup from a preliminary Lie model for (H). 

If one now applies [6] Theorem 4.2] to A, then we see that A 4 contains an ultra coset 
nilprogression HP in normal form commensurable^ with A 4 and hence of cardinality 
~ |H n °|. Furthermore, an inspection of the proof (using f as the Lie model) reveals 
that H lies in the kernel of f> (as L, being simply connected nilpotent, has no non¬ 
trivial compact subgroups), and so by abuse of notation we may also define <f on the 
quotient space (. HP//H ; also, the image of HP will be an open neighbourhood of L. 
Finally, the rank r of HP does not exceed the dimension of the Lie model L. ft remains 
to establish the N-properness, which will ultimately be a consequence of the simply 
connected nature of L. 


Suppose for contradiction that we have a collision 


—n i 

V 


.V? = U\ ...V? 


for some (rq,..., n r ) (n\ ,..., n' r ) with ni, n[ = 0(Ni ) for i = 1,..., r. Cancelling off 
the U\ factors if ri\ = n \, then the u 2 factors if (ui,n 2 ) = (nj, n' 2 ), and so forth, and 
then moving all copies of the remaining Ui to one side using the identity 

gh = hg[g,h\ (3.1) 

and simplifying using repeated application of the upper triangular form condition (i), 
we eventually conclude that 


...TC r = 1 


for some 1 ^ ^ r and m,,... ,m r with vrij = 0{Nf) for i ^ j ^ r and mi positive. 

Because of this and further repeated application of the upper triangular form condition, 
we see that for any other ap,..., a r with aj = O(Nj) for K jT r, we can write 

(3.2) 


—a i 


. uf r = fq 1 


— Oi-1—bi 

■ u i -1 u i 




5 We say that two symmetric sets of a group are commensurable if each set can be covered by a 
bounded number of left-translates of the other. 



14 


TERENCE TAO 


where bj = O(Nj) for i ^ j < r and 0 < 6* < rn t . Meanwhile, by the arguments in [6} 
§9], we see that for each 1 ^ j ^ r there is a one-parameter subgroup t >-> exp (LX, ) in 
L such that 

<Kuj j ) - ex P (( st 

for all rij = O(Nj). From this, the normal form property (i) composed with q i>, and 
the simply connected nilpotent nature of L (which makes the exponential map a diffeo- 
morphism, and the Baker-Campbell-Hausdorff formula globally valid) we conclude in 
particular that [Xj, X^] lies in the linear span of X^+i, ... ,X r in the Lie algebra of L 
for all 1 ^ j < k ^ r. From this and (j3.2j) we conclude that every element in 0((iLP)) 
takes the form 

exp(t 1 X 1 ) ... exp(pXr) (3.3) 

with tj e R for all 1 ^ j ^ r, with 0 ^ ti ^ st jf-. As <j>{HP) is an open neighbourhood 
of L and L is connected, 0 must be surjective on ( HP ). Since r cannot exceed the 
dimension of L, this forces r to in fact be equal to the dimension of L, and the Xi,..., X r 
to be linearly independent. However, due to the limitation on t t and the upper triangular 
nature of the Lie brackets [X J; X fc ] mentioned above, we see that not every element of 
L is of the form (j3.3j) (for instance, exp(LXj) is not of this form if tj is negative or larger 
than st ), giving the required contradiction. □ 


Let HP be as above. As ( A n ° u {1} u A~ n °) 3 is commensurable with HP, it can be 
covered by a bounded number of left-cosets of the group ( HP ); if we let tt : (A no ) —> 
(A n °)/(HP) be the quotient map, we thus see that 7r(A n °) has bounded cardinality. 
On the other hand ti (A m ) is non-decreasing in m. By the pigeonhole principle, we may 
thus find 1 ^ n\ < no such that ^(A" -1 ) = 7r(A ?ll+1 ). There thus there exists a finite set 
X e (A n °) (of bounded cardinality) such that A ni+1 c= X(HP), and such that the cosets 
x(HP) for x e X are disjoint and all have non-empty intersection with A ni . Since 

A ni (HP > = A ni+ \HP) = X(HP > 

we see that left multiplication by any element a of A preserves X(HP ), and so for each 
a e A there is a unique permutation cr a : X —> X with the property that 

ax(HP) = <r a (x)(HP) (3.4) 

for all x 6 X. 


We now apply an argument of Sanders [22] . Since |(A n ° u{l}uA n o) m |5c |A n °| for any 
standard m, we have 

|A 10n °| « |A n °|. 

Thus, by the pigeonhole principle, we may find n 0 < n 2 < 2 n 0 such that 

|A n2+1 | < ^i + 0(i-)"j |A" 2 1, 


thus if we write B := A n2 then 


l + 0(- 
n 0 


B 


|PA|^ 


(3.5) 
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We now introduce a nonstandard semi-metric d : G x G 


d(x,y) 


\BxABy\ 

\B\ 


*[0,2] on G by the formula 

(3.6) 


where A is set-theoretic difference, then d is non-negative, right-invariant (thus d(xg , yg ) = 
d(x, y) for all g e G), symmetric, and obeys the triangle inequality. From (13.51) (and the 
fact that A contains the identity) we see that 


for all a e A. 


d(a , 1) « — 
n 0 


(3.7) 


To use this, we study the geometry of the semi-metric d on the coset space X := 
{x(HP) : x e X}. We would like to define the nonstandard distances 

dx(x(HP), x'(HP » := inf {d(y, y') : y e x(HP)- y' e ft (HP)} 

for any x, x' e X. Here we run into a technical problem in that the inhmum is not 
automatically defined since the set on the right-hand side is external. However, we can 
fix this as follows. First, using right-invariance, we may (formally) write 

d T (x(HP),ft(HP)) = inf {d(x,y') : y' e s'(HP)}. 

Next, observe that if d(x, y') < 2, then from (13. 6 1) Bx and By' intersect, so y' lies in 
B~ l Bx\ since 

\B~ 1 BxHP\ = \A~ n2 A n2 xHP\ 

^ \A~ 2n ° A 2n ° X H P\ 

< \{A~ n ° u{l}uA no )° (1) | 

« \HP\ 

we see that B~ l Bx is covered by boundedly many left translates of HP. In particular, 
B^Bx n x'(HP ) is contained in x'HP m for some standard m! (which can be made 
uniform in x,x', since there are only boundedly many choices for these parameters), 
thus we may (formally) write 

dx(x(HP),x'(HP)) = inf {d(x,y') : y' e x’HP m '} u {2}. 

We take this as the definition of (%, and then one easily verifies that the previous two 
formulae for d y are also valid (interpreting the inhmum as the greatest lower bound). 
This makes d y a well-defined nonstandard semi-metric on X. 

By construction, we have d^x(x(HP), x'(HP)) < d(x,x') for all x,x' 6 X. Unfortu¬ 
nately, it is possible for d-^{x(HP), x'(HP)) to be significantly smaller than d(x,x'), 
which will be undesirable for our purposes. Fortunately, we can fix this by exploiting 
the “gauge freedom” to multiply each x e X (other than the identity, which we wish 
to keep in X) on the right by an arbitrary element of (HP ), which does not affect the 
cosets in X or the metric c%: 

Lemma 3.2. After right-multiplying each x e X (other than the identity) by an element 
of (HP), we can ensure that 

d(x,x') — d~x(x(H P ), x' (H P)) 
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for all x, x' e X. 


Proof. We enumerate X as xi,..., x m with x\ = 1, so that cosets in X are enumerated as 
C\,, C m with Ci := Xi(HP}. We form a spanning tree on these cosets by connecting 
each Ci, 1 < i ^ C m , to the Cj, 1 ^ j < i that minimises the distance d~x(Ci, Cj); 
we refer to this j as the “parent” of i. (ff there is more than one j that minimises 
the distance, break the tie arbitrarily.) This is clearly a spanning tree. We claim the 
following property: if 1 ^ i < j ^ m, then d-x(Ci,Cj ) is comparable to the length of 
the path C t = C kl , C k2 , ■ ■ ■, C kl = Cj connecting C i: to Cj in the spanning tree, thus 

d~x{C kl , CfeJ + • • • + dxiCki-n Cki) — dx(Ci, Cj). (3.9) 

The lower bound follows from the triangle inequality. To prove the upper bound, we 
assume inductively that the claim has already been established for smaller values of 
j. If C kl _ x = Ci then the claim is trivial, so suppose that C kl _ 1 ¥= Ci. As /q_i is the 
parent of ki = j, we see from construction of the spanning tree we have dxiC^Ck,) ^ 
dx(Ci, Cj), so by the triangle inequality dx(Ci,Cki _J « d-x(Ci, Cj). From the induction 
hypothesis we have 

dj((Ck i, C k2 ) + • ■ ■ + 25 << dx{Ci, Ck t _f) « dx(Ci, Cj) 

and the claim follows. 

Proceeding recursively from C 2 to Q, we may now right-multiply each Xi, 2 < i < l by 
an element of ( HP ) such that 

dx(Ci, Cj.) = d(xi, Xj.) 

for all 2 < i ^ l, where j t denotes the parent of i. The claim now follows from (13. 9 p and 
the triangle inequality. □ 

Henceforth we let X be chosen to obey the conclusion of the above lemma. Let a e A 
and x e X. By (13.4j) we have 

ax = x'g 

for some g e <( HP ) and x' := (7„{x). From the above lemma, right-invariance, and (13.71) . 
we have 

d{x,x') « d-x(x(HP}, x'(PtP)) 

^ d(x, x'g) 

= d(x, ax) 

= d{ 1, a) 

1 

« — . 

n 0 

By the triangle inequality and right invariance, we then have 

d(x', x'g) ^ d(x', x) + d(x, ax) «-f d( 1, a) « —. 

n 0 n 0 

By repeated application of right-invariance and the triangle inequality we have 

d(x', x'g m ) ^ md(x', x’g) 
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for any nonstandard natural number m, and in particular there exists m 0 » ?r 0 such 
that 

d(x', x'g m ) < 1 

for all 1 < m ^ m o. By (13.6[) . this means that Bx' and Bx'g m intersect for all such m, 
thus 

g m e (x’^B^Bx’ 

for all 1 ^ m ^ m 0 . By (13.8p . the set (V)” 1 B~ l Bx' can be covered by boundedly many 
left-translates of HP , and so the orbit {g m : 1 ^ m ^ m 0 } is covered by a bounded 
number of left translates of HP. Each of these translates either lies in ( HP ) or is 
disjoint from that group. On the other hand, g lies in (PP). Thus (by induction on 
m) the only left translates of HP that the orbit {g m : 1 < m < mo} can actually reach 
are also contained in ( HP ), so that 

g m g R pO{ 1) 

for all 1 < m < mo- We can then write 


g m e ul ni ' m • • .u^ r ' m H 

for some nj, m = O(Ni) for i = 1,..., r and 1 < m < mo- From Proposition 13.II land the 
upper triangular form (i)) we conclude that ni, m is linear in rn, ni, m = mn^i, which 
implies that 

w Ah 
n 11 « — « —. 
m n 0 

A second application of Proposition 13 .1 1 (and the upper triangular form) then shows 
that ?72 ,m is linear in m up to additive errors of ©(mAh/no), which implies that 712,1 = 
O (Ah /uq ). Continuing in this fashion we see that ru,i = 0(W/no) for all i, which implies 
that « 1/no, thus 

11 7 W 11 1 

cr a (x) ax Ipp « — 

n 0 

for all x e X, and thus by (11.11) we have 


a hp,A' « — 
n 0 


(3.10) 


for all a e A. If we had no ~ n then we would now have Theorem 12.11 however we 
currently only have no « n. We can address this issue as follows. Firstly, if we set 
A! :=Au{l}u A -1 , then from (I3.10|) we have 

1 

H ffp,v <<c — 
n 0 

for all a £ A'; also 

HP <= (A n ° u {1} u A _n °) 3 c (A') 3no . 

By repeating the arguments used to establish (11.4)1 we see that 


HP r 


(A') 


f\Cmno 


XHP 


.C 2 m 


(3.11) 


for some standard C and all nonstandard natural numbers m. By |9j (3.2), Proposition 
3.10] we have \HP m \ ~ \HP m '\ whenever m ~ m', and thus \(A') Cmno \ ---■ \(A') Cm ' no \ 
whenever m ~ m!. In particular, we have |(A') 100n | ~ |(A') n | for any n ^ n 0 . If we 
now repeat the entire argument in this section, and n 0 replaced by n, until we reach 
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(13.101) again (and with HP replaced by some other ultra coset nilprogression), we obtain 
Theorem 12.11 as required. 


4. Further growth of locally polynomial functions 

We now establish Theorem 11.91 Arguing as in Section [2j we can derive Theorem 11.91 
from the following nonstandard version: 

Theorem 4.1 (Further growth of a locally polynomially growing set, nonstandard ver¬ 
sion). Let A be a symmetric internal finite subset of a nonstandard group G containing 
the identity, let n be an unbounded natural number, and suppose that \A n \ ^ n°^\A\. 
Then there exists a continuous piecewise linear non-decreasing internal function f : 
*[0, +oo) — > *[0, +oo), with /(0) = 0 and f having a bounded number of distinct linear 
pieces, each of which has a slope that is a standard natural number, such that 

log \A mn \ = log \A n \ + /(logm) + 0(1) 

for all nonstandard natural numbers m ^ 1. 


Let A and n be as in the above theorem. By the arguments in the previous section, 
A n is commensurable with an ultra coset nilprogression HP in normal form, of some 
standard rank r, which obeys the N-properness property in Proposition 13.1 1 We will 
induct on this parameter r, assuming that the claim has already been proven for smaller 
values of r. From (11.41) and Theorem ll.Sf ii) we have 

\A mn \ ~ \HP m \ ~ \H\ IP™) 

where P := HP/H, which lies in the internal group N(H)/H , where N(H ) is the 
normaliser of H in G. Thus matters reduce to showing that 

log \P m \ = log \P\ + /(logm) + 0(1) (4.1) 

for / as above. 

By construction, P = P(u\,Ni,, N r ) is an N-proper ultra nilprogression in 
normal form. We may assume that N t » 1 for each i = 1 ,...,r, since any i with 
Ni = o(l) can simply be deleted from the progression, at which point we can use the 
induction hypothesis. It will be convenient to lift this progression up to a nonstandard 
simply connected L Lie group (cf. J6, Lemma C.3] for a similar lifting from a local group 
nilprogression to a global group nilprogression): 

Proposition 4.2. There exists a nonstandard simply connected nilpotent Lie group L of 
dimension r (that is to say, the ultraproduct of standard simply connected nilpotent Lie 
groups of dimension r), a linear basis (over*W) A/,..., X r of the associated Lie algebra 
log L (which can be identified with L using the exponential map exp : logL —> L and its 
inverse, the logarithmic map log : L —» log L), and a homomorphism <f : T —* N{H)/H 
from the nonstandard group T < L generated by exp (A/),..., exp(A r ) to N(H)/H such 
that exp(W) = u % for all i = 1,... ,r. Furthermore, we have 

| A,. LCj | ^ ^ C'ij /,. A 

i,j<k^r 


(4.2) 
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for all 1 ^ i, j < r and some nonstandard rationals c^k, such that qc^k is a nonstandard 

integer for some bounded positive integer q, and such that Cijk = O j, where [, ] of 

course denotes the Lie bracket on £. Finally, for any standard C > 0, 0 is injective on 
P(exp(X 1 ),..., exp(X r ); CN U CN r ). 

Proof. We induct on r. The case r = 0 is vacuously true. We could use this as the base 
case, but the r = 1 case is also easily verified directly, by setting L = log L := *M (with 
L expressed using additive notation), X] : = 1, and 0(n) := rCj for any nonstandard 
integer n. 

Now suppose inductively that r > 1 and that the claim has already been proven for 
r — 1. In particular, there exists a nonstandard simply connected nilpotent Lie group L 2 
of dimension r — 1, a linear basis X 2 ,..., X r of log L 2 over *R, and a homomorphism 0 2 : 
T 2 — N(H)/H from the nonstandard group T 2 < L 2 generated by exp(X 2 ),..., exp(X r ) 
to N(H)/H such that exp(Xj) = u* for i = 2, ...,r, and such that (14.2|) holds for 
2 ^ i,j < r and structure constants with the stated properties, and with <f> 2 
injective on P(exp(X 2 ),..., exp(X r ); CN 2 ,..., CN r ) for any standard C. 

From the normal form hypothesis on P, we have 

[ui,Uj] = vf}f J { J+1 .. .Uf i ’ j ’ r 

for 1 ^ i < j ^ r and some nonstandard integers rUj-y = ). From the injectivity 

of 0 2 , we conclude that 

[exp(Xj), exp(Xj)] = exp(X i+1 )"^+ l .. ,exp(X r )^> 

for 2 < i < j ^ r. It is clear that these relations on the generators exp(X 2 ),..., exp(X r ) 
define T 2 as a group. 

Now, the element uj acts on N(H)/H by conjugation 7 ] : g u\~ l gui = g\ui,g]~ l . I 11 

particular, 

[Vfc), V(uj)] = //(L7/. * ! . ..rj(u r ) ni ^ 

for 2 ^ i < j ^ r. Observe that rj(uf) = 0 2 (exp(X')) for 2 ^ i ^ r, where 
X[ := log (exp (Xj) exp(X r )-^- ... exp(X i+1 )- ni ^+ 1 )- 

All of the terms in the above identity then lie in P(exp(X 2 ),..., exp(X r ); CN 2 ...., CN r ) 
for some standard (7, so by injectivity of 0 2 we conclude that 

[exp(X'),exp(X')] = exp(X' +1 )^> 1 . ..exp(X;)"-> 

for 2 ^ i < j ^ r. Thus there exists a nonstandard group homomorphism rj : T 2 —> T 2 
that maps exp(Xj) to exp(X') for 2 ^ i ^ r. Applying the same considerations to the 
inverse conjugation r/ -1 we see that rj is invertible, and is thus a nonstandard group 
automorphism on T 2 . 

We would like to extend this automorphism from the nonstandard discrete group T 2 to 
the nonstandard Lie group L 2 . We first work in the intermediate nonstandard group 
r 2 (*Q), defined as the set of all elements g e L 2 such that g n e T 2 for some nonstandard 
positive integer n. From the Baker-Campbcll-Hausdorff formula (which has only finitely 
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many terms in the nilpotent group L) we see that this is a nonstandard subgroup 
of L 2 that contains T 2 . We define the extension fj : T 2 (*Q) —> T 2 (*Q) by setting 
fj(g) := f](g n ) l,n for any g e T 2 (*Q) and any nonstandard positive integer n with 
g n e r 2 , where we write g t := exp (f log g) for g e L 2 and t e *M. It is easy to see that 
this extension is well-defined, and from the Baker-Campbell-Hausdorff formula one can 
verify that it is a nonstandard group homomorphism; applying the same considerations 
to the inverse of fj we see that fj is in fact a nonstandard group automorphism on r 2 (*Q). 

The group T 2 (*Q) is an internally dense subgroup of L 2 , and fj is internally locally 
uniformly continuous, so fj extends uniquely to a nonstandard continuous group ho¬ 
momorphism on L ‘2 , which on consideration of the inverse is in fact a nonstandard 
continuous group automorphism on L 2 . As all continuous homomorphisms between Lie 
groups are smooth, fj is in fact a nonstandard smooth group automorphism on L 2 . It 
induces a corresponding nonstandard Lie algebra automorphism log fj : log L 2 —> logL 2 . 
By the Baker-Campbell-Hausdorff formula, we see that log f/(W) — X* lies in the linear 
span of X i+ 1 ,..., X r over *M for 1 ^ ^ r. 

The space of nonstandard Lie algebra automorphisms : logL 2 —> logL 2 with the 
property that <L(Xj) — X, lies in the linear span of W+i, ■ ■ ■ ,X r over *M for 1 ^ i ^ r 
can be verified to be a nonstandard simply connected nilpotent Lie group. In particular, 
we can define an internal one-parameter group (log?))* : logL 2 —> logL 2 in this group 
for t 6 *M that depend in an internally continuous fashion on t, which in turn defines an 
internal one-parameter group of nonstandard Lie group automorphisms ff : L 2 —»• L 2 
that also depend internally continuously on t. This lets us define an r-dimensional 
nonstandard Lie group L := *M L 2 , which extends L 2 by an internal one-parameter 
group {exp(tXi) : t e *M} such that exp(— tXi)gexp(tXi) = fj t {g) for all 1 e *R and 
g e L 2 . The map g g^ff^g) is nilpotent from the structure of log ff and the Baker- 
Campbell-Hausdorff formula, so L is nilpotent; as it is internally homeomorphic to *M, 
it is also internally simply connected. 

From construction, we have 

exp(-X,) exp(Xj) exp(Aj) = exp(Xj) 

for 2 ^ j ^ r, which gives (14.2(1 for the remaining case i = 1 from the Baker-Campbell- 
Hausdorff formula and a downward induction on j. This identity also yields 

0 2 (exp(—Xi)exp(X J )exp(X 1 )) = u^ 1 (j) 2 (exp(X j ))u 1 . 

Since the exp(Xj) for 2 ^ j ^ r internally generate T 2 , we conclude that 

0 2 (exp(—Xi)gexp(Xi)) = u^ 1 (j) 2 (g)ui 

and thus 

0 2 (exp (—uXi) ^ exp (nXi)) = 

for all g e T 2 and n e *Z. If we then define T := x T 2 to be the internally 
discrete subgroup of L generated by exp^Xj and T 2 , we may thus extend the internal 
homomorphism (f> 2 : T 2 —> N(H)/H to an internal homomorphism <f> : T —> N(H)/H 
such that (^(expfX])) = u x . 
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Finally, we have to demonstrate the injectivity of 0 on P(exp(Xt),..., exp(X r ); 0(iV 1 ),. .., 0(N r )). 
Suppose for contradiction that injectivity failed. Gathering terms, we obtain a collision 
of the form 

0(exp (niXi) .. .exp (n r X r )) = ^(exp(n / 1 X 1 ).. .exp (n' r X r )) 
for some ru, n' = 0(Ni ) with (ni,..., n r ) ^ (n\ ...., n' r ), and thus 

But this contradicts the N-properness of P(u\, ... ,u r ; Ni, ..., N r ). □ 

Remark 4.3. One could also construct the Lie group L here using the theory of Mal’cev 
bases [LB]. 


Let L, (j), Xi, ..., X r be as in the above proposition. The set P™ can now be expressed 
as 

F = cf>(Q m ) (4.3) 

where Q is the nilprogression 

Q ■= P(exp(Ad),... ,exp(X r ); iVi,..., N r )). 

It is thus natural to begin analysing the geometry of Q m . To do this, we perform some 
calculations related to those in ®, m (see also the analysis of Carnot-Caratheodory 
balls in HU, H for some analogous calculations). Define a formal commutator word 
to be any string generated by the following rules: 

• For any i = 1,..., r, i and P 1 are formal commutator words. 

• If w i, w -2 are formal commutator words, then the strings [u;i, u^] and [ut,W 2 ] -1 
are formal commutator words. 

Thus for instance [[1, 2 -1 ] -1 , [l -1 , 3]] -1 will be a formal commutator word if r > 3. 
Define the length |w| of a formal commutator word w by requiring i, P 1 to have length 
1 for i = 1,...,r, and [iui,iu 2 ], [«h, W 2] -1 to have length (url + |u> 2 | for any formal 
commutator words Wi,w 2 . Thus for instance [[1,2 -1 ] -1 , [1 —1 ,3]] —1 has length 4. 

Given a formal commutator word w, we define the element X w of log L as follows. 

• For any i = 1,..., r, we keep X t as before, and write X t -i = —Xf. 

• If w\, w 2 are formal commutator words, then X[ Wl . W2 ] = log([cxp(X tUl ), exp(X !i , 2 )]) 
and X^ Wl W2 j—i X[ Wl V)2 ]. 

Thus for instance ^[pg-q,p- 1 , 3 ]]-! = log[[exp(Xi),exp(-X 2 )], [cxp(-Xi),exp(X 3 )]] _1 . 
Finally, we write N for the vector N := (Ni ,..., N r ), and define N w for any formal 
commutator word by the following rules: 

• For any i = 1,... ,r, we write N l = N l 1 := Ni. 

• If Wi,w 2 are formal commutator words, then jVf™ 1 ’™ 2 ] = N^ Wl,W2 ^ 1 := N Wl N W2 . 
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Thus for instance TV^ 1,2 1,3 H 1 = NfN 2 N 3 , and for any scalar m, (mN)^ 1,2 1,3 H 1 = 

m 4: N 2 N 2 N 3 . Since N t » 1 for all i, we have N w » 1 for all w. Note that ( mN) w = 
rn\ w \N w for any scalar m. 

Since [X t ,Xj] lies in the (nonstandard) span of X k for k > i,j, we see that X w = 0 for 
all but a bounded number of words w (for instance, u w = 1 whenever |tc| > r). Let 
W be the collection of formal commutator words for which X w ^ 0. We enumerate 
W = W\,W 2 , ■ ■ ■ ,Wk in non-decreasing order of length, starting with the generating 
words 1,... ,r, so in particular [wi,Wj] lies further along W in this enumeration than 
Wi or Wj, or else fails to lie in W at all. From the Baker-Campbcll-Hausdorff formula 
we see that for any 1 ^ i < j ^ k, [X w .,X Wj ] is a linear combination (over the standard 
rationals Q) of the X Wl for j < l ^ k. 

We have the following rough description of P(exp(W), ■ ■ ■, exp(W r ); TnN l ,..., mN r ) 
from [25] : 

Proposition 4.4. Let m be a nonstandard natural number. 

(i) One has 

exp(n u X wl ) .. .exp {n k X Wk ) e Q 0(m) 

whenever ni,...,nk are nonstandard integers with nj = 0((mN ) W:i ) for j = 

1 ,... ,r. 

(ii) Conversely, every element g of Q m can be written in the form 

g = exp(niX wl ).. .exp (n k X Wk ) 

where n\,... ,n k are nonstandard integers with nj = 0((mN ) W: >) for j = 1 ,,r. 


Proof. This follows from |28l Proposition C.l]. (Strictly speaking, that proposition as 
written restricts the words under consideration to be basic commutator words, but as 
observed in [9], the argument extends without difficulty to arbitrary words.) The claim 
can also be established from [£l (3.2), Proposition 3.10]. □ 


From Proposition 11.11 and the Baker-Campbcll-Hausdorff formula we also see that 
every element g of Q m can be written in the form 

g = exp(aiX m + ■ • • + a k X Wk ) 

where a ±,..., a k are nonstandard rationals with standard denominator and aj = 0((m,N ) W: >) 
for j = 1,..., r. A further application of the Baker-Campbell-Hausdorff formula gives 

r 

X Wi = J] onjXj (4.4) 

3 = 1 

for some coefficients cpj = 0(Nj/N Wi ) that are nonstandard rationals with standard 
denominator. We can therefore write 

r 

g = exp(J] bjXj) 

3 = 1 


(4.5) 
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with bj nonstandard rationals with standard denominator, and the vector b := (bi,... ,b r ) e 
*K r lying in the (internal) convex hull B m of +C(mN) Wi di for i = 1,..., k, where C 
is a sufficiently large standard quantity and cq e *R r is the vector cq = (ayi,..., ay r ) 
Since a i,...,a r is just the canonical basis for *M r , this convex hull B m contains the 
unit cube (if C is large enough), and so from standard volume packing arguments we 
see that 

\Q m \ «vol (B m ). 

We also have a matching lower bound: 

Proposition 4.5. For any nonstandard positive integer m, we have 

\Q m \ » vol (B m ). 


Proof. Since B m and Bc m have comparable volume for any standard C > 0, it will 
suffice to show that \Q c ° m \ » vol (B m ) for some standard C 0 . 

By volume packing, we know that B m contains » vol (B m ) lattice points in *711 (if C is 
large enough). Let (aq,..., x r ) be one of these lattice points, and form the Lie algebra 
vector 

r k 

v = Yj X i X i = 2 X k X Wi 
2 = 1 2=1 

with Xi = 0((mN) Wi ) nonstandard real for i = 1,... ,r (in fact they are nonstandard 
integer), and x r+ i = ■ ■ ■ = x k = 0. Our strategy will be to factorise 

exp(u) = exp(xiW wl H-h x k X Wk ) 

as an element of Q 0 ( rn ' > times a “bounded” error. By rounding x± to the nearest integer 
ni and then using the Baker-CampbelLHausdorff formula, we may write 

exp(u) = exp(ni W-’J exp(x' 2 X W2 H-h x' k X Wk ) exp (tiX Wl ) 

where t\ = 0(1) is a nonstandard real, rq is a nonstandard integer with ri\ = 0((mN) Wl ), 
and x f j = 0((mN) Wj ) are nonstandard reals for j = 2,..., k. Iterating this procedure, 
we can obtain a factorisation of the form 

exp(u) = exp(niA A m ) .. .exp (n k X Wk ) exp (t k X Wk ).. .exp (tiX wl ) 

where rii = 0((mN ) Wi ) are nonstandard integers and ti = 0(1) are nonstandard reals 
for i = 1,..., k. In particular, by Proposition 11.11 :) we have 

exp(u) e Q° {m) exp{t k X Wk ) .. .exp(f 1 X„ J] ). 

Using 'll. 1 li and the Baker-Canrpbcll-Hausdorff formula (discarding the N Wi denomina¬ 
tor in the bounds on we thus have 

exp(n) e Q 0{ - m) exp(j/iXi H-f y r X r ) 

where yi,...,y r are nonstandard reals with y* = O(iVj) for i = 1,...,r. Repeating the 
previous factorisation procedure, we then have 

exp(u) e Q °( m ) exp(n / 1 A\)... exp (n' r X r ) exp(f(A r )... exp(t / 1 A 1 ) 
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where n- = O(iVj) is a nonstandard integer and t\ = 0(1) is a nonstandard real for each 
i = 1,... ,r. The product exp(n' 1 Xi)...exp (n' r X r ) lies in Q hence 

exp(u) e Q° {m) exp (t' r X r )... exp(t' 1 X 1 ) 

or equivalently 

exp (t' r X r )... exp (t[Xi) e Q °( m ) exp(n). 

By construction, v is a linear combination of , X r with coefficients that are 

nonstandard rational with standard bounded denominator. By the Baker-Campbcll- 
Hausdorff formula, the same is therefore true of logexp(t(X r ) ...exp(i' 1 Xi), which by 
further application of the Baker-Campbell-Hausdorff formula and induction on i shows 
that each t\ is also a nonstandard rational with standard denominator. Since t\ = 0(1), 
we conclude that there are only a bounded number of possibilities for each t\. We 
conclude that 

exp(u) e Q°^X 

for some set X c L of bounded cardinality that is independent of v. Since there are 
» vol (B m ) possibilities for exp(u), we thus have | Q c ° m \ » vol (B m ) for some sufficiently 
large Co, as required. □ 


From Cramer’s rule (and selecting the r-tuple of vectors of the form + (mN) Wi di that 
have the largest wedge product), every element of B m can be written as a linear combi¬ 
nation of r vectors of the form ( mN) Wi di with nonstandard real coefficients of size 0(1). 
Conversely, any such linear combination will lie in B m if the coefficients are smaller than 
some sufficiently small standard e > 0. From this we see that 

vol(£ m ) ~ ^ \{mN) w ^a h a • • • a (mN) Wir a ir \. 

The right-hand side is a polynomial V (m) in m of bounded degree and non-negative 
coefficients. From the preceding bounds on Q m we thus have 

\Q m \ ~ V(m). 

Using x + y ~ ma x(x,y) and the polynomial nature of V, we see that log V(rri) is 
asymptotic to a piecewise linear continuous function of logm, with boundedly many 
pieces and all slopes non-negative bounded integers. This gives an estimate of the form 

log \Q m \ = log \Q\ + /(logm) + 0(1) (4.6) 

with / of the required form (and with the additional property of being convex). By 
(14.3)1 . this establishes the claim in the “*N-proper” case where 0 : T > N(H)/H is 
injective. 

Now suppose we are in the remaining case when / is non-injective. Then (by the 
nonstandard well-ordering principle), there exists a nonstandard natural number mo 
such that (j) is injective on Q m if and only if m < mo; by Proposition 14.21 we see that mo 
is unbounded. From (14.6)1 we see that the desired estimate (14.11) is already established 
for m < mo; actually from (14.61) and the monotonicity of log |P | in m, it is established 
for m = O(mo). It remains to handle the case when m ^ mo- 
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Since the homomorphism <f) is not injective in Q m °, there exists a non-identity element 
g of Q 0(jn °l such that <f{g ) = 1. If g does not commute with every generator exp(W) of 
Q , then we may replace g with the commutator [ g , exp(W)] for some i = 1,..., r, which 
still lies in Q °( m °) (with a slightly larger implied constant). Repeating this procedure a 
bounded number of times, we may assume without loss of generality that g commutes 
with every exp(W) and thus with T; since Y is internally cocompact in L and the group 
operations are polynomial, this implies that g is a central element of L. 

Next, we observe from Theorem II,5f ii) (or (14.61) ) that |P m | — |p m °| for all m — m 0 , 
which by [251 Corollary 3.11] shows that the P are ultra approximate groups for all 
m ~ mo- As in the proof of Proposition 13.11 we can then find an ultra approximate 
group A c: p° (mo ' ) commensurate with P” 4 ’ with a global model -0 : (A) —» 1/ to a 
connected, simply connected nilpotent Lie group L'. 

We have a key dimension reduction estimate: 

Proposition 4.6. The dimension of L' is strictly smaller than r. 


Proof. Recall we have the internally convex body B mo in *R' r . This generates two 
external subspaces (over R) of *R r ; the space O(B m0 ) consisting of all vectors v of the 
form A w for some A e R and w e B mo , and the subspace o(B mo ) be the set of all vectors 
v e *R r such that Xv e B mo for all Ael. By John’s theorem ra, we see that o(B mo ) is 
an (external) subspace of 0(B mo ), with a quotient ()(B mo )/o(B mo ) that has dimension 
exactly r (over R). Since m 0 is unbounded, o(B mo ) contains R r , and so in particular 
O(B rn0 )/o(B m0 ) is also the image of 0(B mo ) n *Z7 under the projection map coming 
from quotienting by o(B mo ). 

Pulling back the Lie algebra structure on log L under the map £ : (iq,..., v r ) >-> V\X\ + 

• • • + v r X r , we obtain a nonstandard nilpotent Lie algebra structure on *R r . From the 
definition of B mo , we see that this Lie bracket preserves 0(B mo ), with o(B mo ) as an 
external Lie algebra ideal (over R). Thus the vector space O(B m0 )/o(B m0 ) acquires the 
structure of an r-dimensional nilpotent Lie algebra (over R). Exponentiating this, we 
obtain an r-dimensional nilpotent Lie group L 0 := exp( 0 ( B mo )/o(B mo )) (over R). 

Similarly, £(0(B mo )) is an (external) subspace (over R) of logL that is closed under Lie 
bracket, with £(o(B mo )) an external Lie algebra ideal, so exp(£(o(B mo ))) is a normal 
subgroup of exp (f(0(B mo ))) in L, and as ^ is a Lie algebra isomorphism, we see that 
the quotient group exp(£(0(-B mo )))/exp(£(o(-B mo ))) may be identified with L 0 and is 
thus also an r-dimensional nilpotent Lie group over R. 

Since exp (tiXf) lies in exp(f(o(B mo ))) for any i = 1,... ,r and bounded U, the same 
rounding argument used to prove Proposition H75] shows that any element of exp (£(0(B mo ))) 
may be factored as an element of ( Q m °) and an element of exp(£(o(U mo ))). In particular, 
this implies that the image of Q m ° a exp(^(0(R mo ))) in L 0 is a compact neighbourhood 
of the identity. 
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Recall that A c p Cm ° = <; b(Q Cm ° ) for some standard natural number (7. The set 
QCm 0 n 0 -!^^ when projected onto L 0 , is then a bounded symmetric set E , and from 
countable saturation it is closed. Since A is commensurable with P m ° = (j)(Q m °), we see 
that the image of Q m ° in Lq can be covered by a bounded number of translates of E , 
so that E has positive measure. By the Steinhaus lemma, this implies that E 2 contains 
a neighbourhood of the identity, and in particular generates L 0 as a group. From this 
we see that to every group element h e L 0l we may find an element h in the preimage 
of h in (Q m °) such that 0(h) e <(A). This element h is defined up to an element of 
(Q m °) n exp(£(o(R mo ))). 

Suppose we have two such preimages h. hi , then h' = hk for some k e (Q m °) n 
exp(£(o(.B mo ))) with 0(h) s (A). For any standard natural number n, we have k n s 
(Q m °) n exp(£(o(R mo ))), so by arguing as in the proof of Proposition 14.51 we see that 
k n c : Q Cm °X for some standard C and some set X of bounded cardinality (both inde¬ 
pendent of n) , so that 0(h) n a p c "‘°Y for some set Y of bounded cardinality. As A 
is commensurate with P m °, we conclude that 0(/c) n c AZ for some set Z of bounded 
cardinality. In particular, from the pigeonhole principle we see that (j)(k) n e A 2 for 
infinitely many n, and thus 0(0(h)) n c 0(A 2 ) for infinitely many n. But 0(A 2 ) is 
precompact and L' is a simply connected nilpotent Lie group, which (as can be seen 
by taking logarithms in L') forces 0(0( k )) = 1 and hence 0(0(/i)) = ' i A(</ > (^ / ))- We can 
thus define a map <L : Lq L' such that 

*(h) = wm) 

whenever h e L 0 and h e <( Q m ° ) is in the preimage of h with 4>(h) e (A). From 
construction one can verify that is a group homomorphism, and is continuous at the 
identity, and is thus a Lie group homomorphism. Since ijj : (A) —> L' is surjective, we 
see that $ is surjective also. 

As L 0 has dimension r, this already shows that L' has dimension no larger than r. 
To show that L' has dimension strictly less than r, it will suffice to show that $ is 
not injective. To do this, suppose for contradiction that <F is injective. Recall from 
construction of mo that <f> is not injective on Q m °. Thus, there exists a non-identity 
element g of Q 2m ° that lies in the kernel of </>. If g' is the projection of g to Lq, we thus 
see that g' lies in the kernel of <f>, and is thus the identity since we are assuming <f> to be 
injective. Arguing as in the proof of Proposition 14.51 we conclude that g n e Q Cm °X for 
all standard n, some standard (7, and some X of bounded cardinality. Since Q Cm ° is 
commensurate to Q m °A : we conclude from the pigeonhole principle that g n e Q m °l 2 for 
some positive standard integer n. But g n is not the identity (as can be seen from taking 
logarithms) and in the kernel of 0, thus 0 is non-injective on Q m °l 2 , a contradiction. □ 


Using this global model ijj in the proof of Proposition 13.11 we may thus find an IT- 
proper ultra coset nilprogression H'P' c ’p o( -" t0 ' 1 G f ran k strictly smaller than r with 
\H'P'\ ~ |P m °|. From III .4)1 and the induction hypothesis, we have 


log | P \ = log 


ip™ 0 1 


+ /'(log ml) + 0(1) 
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for all nonstandard natural numbers m and some piecewise linear continuous function 
/ with boundedly many pieces and all slopes non-negative bounded integers. By mono¬ 
tonicity of log | P | in m, this implies that 

log \P m \ = log \ P m " + /'(logm — logm 0 ) + 0(1) 

for all m ^ m 0 . Concatenating this with the already established case m = O(m 0 ) of 
the estimate (14.11) . we obtain (14.1|) for all m, as required. 


5. Inverse theorem for polynomial growth of measures 


In this section we prove Theorem 11.121 


Just as Theorem II.71 follows from the nonstandard counterpart in Theorem 12.11 we may 
similarly obtain Theorem 11.121 from a nonstandard analogue. Define a nonstandard 
probability measure p : G — » *M + on a nonstandard group G = n n ^ Q Gy to be an ultra¬ 
limit of standard probability measures p n : G n —> M + . One can define the (nonstandard) 
convolution and £ 2 norm of such a probability measure in the obvious fashion, as well 
as define what it means for a nonstandard probability measure to be symmetric. Given 
a nonstandard subset E of G , the quantity p(E) is then a nonstandard real number 
between 0 and 1, and given a nonstandard function / : G —» [0,+oo], the integral 
\ G f dp e *[0, + 00 ] is a nonstandard element of [0, + 00 ]. 


By repeating the “compactness and contradiction” arguments used to derive Theorem 
11.71 from Theorem 12.11 we may thus derive Theorem 11.121 from 


Theorem 5.1 (Inverse theorem for polynomial growth, nonstandard formulation). Let 
p be a symmetric nonstandard probability measure on a nonstandard group G, let e > 0 
be standard, and suppose that 


W 


II-2 

H^(G) 


< n 


0 ( 1 ) 


M 


-2 

P{G) 


(5.1) 


for some unbounded natural number n. Then there exists an ultra coset nilprogression 
HP, and a finite subset X of G of bounded cardinality containing the identity, such that 


and such that 


L 


for some exceptional set E with 


hp\ « bTty, 

(5.2) 

Ill'll HPx d'jj/ioc') 
e ’ n 

(5.3) 

P( E ) <<!-«• 

(5.4) 


We now prove Theorem 15.11 Henceforth we abbreviate £ 2 (G) as £ 2 . Let B be a large 
standard quantity to be chosen later. Let n' be the element of [l,n] that maximises the 
quantity 
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then we have 

|| v* n '\\ P (n') B > \n* n \em B » n B -°^\n\\ P > n B ~°^ ||// n '|k 2 (5.5) 

and thus 

n i-o(i/B) ^ n ' n 

and in particular 

n 1-£ / 2 = o(n') 

if B is large enough. 

Let no '■= [n'/lOOj, then no is an unbounded natural number with 

I *100no II „ ||,,* rl o|| „ 

w Ik 2 « It a* Ik 2 - 

(We allow implied constants to depend on B unless otherwise specified.) 

Applying the form of the Balog-Szemeredi-Gowers theorem from [5] Proposition A.l], 
together with Proposition 13.11 we thus have 

supii* 10no (xHP) » l (5.6) 

xeG 

for some ultra coset nilprogression HP in normal form with 

\HP\ « (5.7) 

Furthermore, the nilprogression HP is infinitely proper in the sense of Proposition 13. 11 


By the pigeonhole principle, we may find ri\ in [n 0 ,2/i 0 ] such that 


||^ ni+1 ||* = (i - o (^)) l^MU 2 

and 

\HP\ « l/||// ni || 2 2 . 

From construction we have 

n 1-£ / 2 = o(ni). 


(5.8) 

(5.9) 
(5.10) 


Squaring (15.81) . we see that 

L L ^ 9 * ^ ni ’ 4 * /i * ni> d ^s)Mh) =(l-0 ^ ) Wfi* ni II l 

and hence by the cosine rule 


II 

JG JG 


114 * li* ni - s h * v* ni \\p dfj,(g)dii(h) « — \\ii* ni fp. 

no 


(5.11) 


ft will be convenient to manipulate this expression a bit. Taking square roots, and then 
using the triangle inequality in h, we see that 


L 


\5 g *n* ni -»* ni+ %dM) 


y/2 


« —TTT lip*" 1 ||^2. 


n. 


1/2 I 


(5.12) 
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By Young’s inequality we also have 


II e ^ ,,*711+1 _ ,,*ni+2||2 

o< 




- h 


p 


dfi(g) 


1/2 


// II i ,*ni II 

« ——\\n \\p. 


n. 


1 / 2 ' 


Also, from (15.11(1 we have 

\ 1/2 i 

IK -1 * 8 g * r 1 v* n % dfj,(g)dfi(h) « -^ 

/ n n 


p 


G JG 


and thus by the triangle inequality in g,h and the symmetry of fi 

\ 1/2 1 

V* ni \\% dn{g)dfi(h)J « —^\\ij* ni \\p 




*ni+2 


G JG 


which when combined back with (15.13(1 and the triangle inequality gives 


\8 g * /i* ni+1 - fi* ni+2 \\% dn(g) 
IG J 

and thus, on combination with (I5.12p we have 


\ 1/2 


// II , .*711 || 

« ~^W \\P 


1 / 2 ' 


1/2 

\8 g * v — v\p d/u,(g ) ) « 


G 


n. 


1/2 


n, 


I ,,* n l II 

u 2 


(5.13) 


where v := (/r* ni + /i* ni+1 )/2; as v and fi are symmetric, we may reverse this as 

\ 1/2 


(L ^ * 69 ~ <<C H^*" 1 1 


Tims, if we introduce the right-invariant semi-metric 

d(g,h) := \\u * 5 g - v * 5 h \p/\iJi* ni \p 

then we have 

(5.14) 

To use this, we exploit the following structural property of a small ball in the d metric: 

Lemma 5.2. There exists a standard e > 0 such that the set {g e G : d(g, 1) ^ e} is 
covered by 0(1) left translates of HP 2 . In particular, it is a nonstandard finite set. 


L 


d(g , l) 2 dfj,{g) « 


Proof. Let N be a standard natural number, and suppose that there exist gi,... ,g^ in 
{g 6 G : d(g , 1) ^ e} with giHP ,..., gNHP disjoint. We will show that N is bounded 
by a standard constant C = 0(1) independent of N, which then gives the claim from a 
greedy argument. 

Since d(gi, 1) ^ e, we have 

||p * d gi - v\\p ^ £\\n* ni \\ e 2, 


but from (15.6(1 we have 


Ks/o HP) » 1 














30 


TERENCE TAO 


for some y 0 e G, and so by Cauchy-Schwarz 


^WpiyoHP) » \HP\ 1/12 








But from Young’s inequality we have 

IMk 2 < ll/ / * ni |k 2 


and the claim follows. 


□ 


We can then cover {g e G : d(g, 1) ^ e) by 0(1) left cosets of ( HP ). By the principle 
of infinite descent, taking e sufficiently small, we may assume that {g e G : d(g, 1) < e} 
and {g e G : d(g, 1) < e'} meet exactly the same set X c G/(HP ) of left cosets of 
(. HP ) for any standard e' > 0. 

We can put a (nonstandard) quotient metric d;y on X by declaring 


d~x(x(HPy, y(HP » := inf {d(g, 1) : g e G; d(g, 1) Y e; gx(HP > = 


noting that the set in the inhmum is always a nonstandard finite set. By the construction 
of e, we see that all distances in are o(l). Repeating the proof of Lemma 1X21 we 
may express X as {x(idP) : x s X} for some finite set X containing 1 of bounded 
cardinality, such that the cosets x(HP) for x e X are all distinct, and 


d(x,x') ~ dx(x(HPy, x'^HP}) 
for all x, x' e X. In particular 

X c: {g e G : d(g , 1) = o(l)} cz {g e G : d(g, 1) < e} a XHP C 
for some standard C, and that 

d(g, 1) » d(x, x') 


(5.15) 


(5.16) 


whenever x,x' e X and g e G are such that gx e x\HP ). By right invariance, 
d(g, 1) = d(gx,x), and thus 

d(gx, x) « d(g, 1 ). 

We can write gx = x'h for some h s HP C , thus 

d(x', x'h) « d(g, 1) 

and thus by right-invariance and the triangle inequality 

d(x', x'h 11 ) « nd{g, 1 ) 

for any nonstandard n. In particular, by (15. 151) . h n e HP C whenever nd(g, 1) ^ e' for 
some standard e' > 0. By the properness of HP as in Section [3J we conclude that 
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and thus 

\\g\\HP,x « d(g , 1). 

This claim is clearly also true when d(g, 1) > e. We conclude from (15. 14j) that 

| II 9 II hp, x dg,{g) « • 

Jg n o 


(5.17) 


Now we control the large dimensions of HP. From Markov’s inequality and (15.1711 we 

I /o 

have \\g\\HP,x = O(l/n 0 ) for a nonstandard set of //-measure » 1. In particular, we see 

that g e XHP otyl ^ n o ) for such a set. From the Cauchy-Schwarz inequality, we conclude 
that 

1 « |Xi7P° (1/n o /2) | 1/2 || / a||^, 

and thus 

\ H pO{l/ nl ' 2 )\ » U^II-2. 

Meanwhile, from (15.7(1 . (15.5(1 . we have 

\HP\ « \\g* n °\\^ 

« \\v* n '\\p 2 
« (n'/n) B \\iJ,* n \\p 2 
« (n7n) s n° (1) ||/i||7 2 2 

« (n')° ( ^|^||/ 

«n? (1) IM4 2 

if is large enough, where the implied constant in the 0(1) exponents do not depend 
on B. Thus we have 

|tfP°(i /n ° /2 )| » n"° (1) |PP|. 

If we let Li,..., L r be the dimensions of HP, we thus have 

fl(i + -4)»"o° (1> fl(i + G) 

i =1 n 0 i =1 

and thus L t » n £ Q 8 for at most 0(1) values of i, where 0(1) can depend on e but does 
not depend on B. 

From (15.1711 and Markov’s inequality we know that \\g\\HP,x < n 0 e 4 for a nonstandard 
set of //-measure 1 — O(n 0 1+£ ^ 4 ) = 1 — 0(n _1+£ ); we denote the complement of this set 
as E. Let HP be the coset nilprogression HP with all generators Ui with N t = o(n 0 7 ) 
removed; one easily checks that HP is still a coset nilprogression in normal form, and 
from the previous discussion HP has rank 0(1) independently of B. For g E, we 
have ||5 , ||_h'p ) x < n Q £ ^ an d hence |g|4px < R-o^ 4 also; from (15.1711 we have 

f \\gf HPx dn{g) « —. (5.18) 

Jg\e ’ «o 

If n/riQ is bounded then we are now done (using HP in place of HP). Now suppose 
n/n 0 is unbounded. Let m be the (nonstandard) integer part of (n/rj-o) 1 / 2 , and set 
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A := HP™. Using the normal form of HP, one can verify that HP™ 1 is covered by 
a bounded number of translates of HP™ 2 whenever ni\ « m 2 . In particular, A is an 
ultra approximate group. We can give (A) the structure of a locally compact (but not 
Hausdorff) group by declaring the neighbourhoods of the identity to be those sets that 
contain P[p m ' for some m « m' « m; this gives a locally compact Hausdorff model 
for (A) after quotientiug out by the closure of the identity (i.e. the intersection of 
all the HP™' for m « m! « m). Applying [6j Theorem 4.2] and Proposition 13.11 we 
see that A is commensurate to an N-proper ultra coset nilprogression HP' in normal 
form. Furthermore, if one goes through the proof of (6) Theorem 4.2] starting with the 
locally compact Hausdorff model provided above, we see that HP' contains an open 
neighbourhood of the identity in (A) (basically because the local Lie models used in the 
proof are local quotients of the original locally compact model). Thus we have 

HP™ 1 c: HP' c: HP™ 2 

for some m « mi « m 2 « m. On the one hand, this implies that 

1 n /2 

\\9\\hp',x < —\\9\\hp,x « \\9\\hp,x 

whenever ||<?||^p x < 1- It also implies that 

\HP'\ s$ \HP™ 2 \ 

« m° {1) \HP\ 

« (n/n 0 ) o(1) |iLP| 

where the 0(1) exponent does not depend on B due to the bounded rank of HP. For 
B large enough, (15.7)) . (15. 5 p then gives 

\HP'\ « \p*% 2 2 

and the claim follows (using HP' in place of HP). 


6. Direct theorem for polynomial growth of measures 


We now prove Theorem 11.131 Again, it suffices to establish a nonstandard variant: 


Theorem 6.1 (Forward Littlewood-Offord theorem). Let G be a nonstandard group 
with nonstandard discrete symmetric probability measure /i. Let n be an unbounded 
natural number. Suppose that there exists a nonstandard coset progression HP in normal 
form and a non-empty set X of bounded cardinality such that 


Then one has 


j \\9\\hp,x dp{g) « n 
JgeG 


/<*iy« 1 hp 


( 6 . 1 ) 


The derivation of Theorem II . 131 from Theorem 16.II proceeds as in previous sections and 
is omitted. 
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Let e > 0 be a small standard quantity to be chosen later. If we let fi' be /j conditioned 
to the event ||g||fl-p,x < e, then from (16.ip // is a nonstandard symmetric discrete 
probability measure with 

and thus 

H* n » (//)* ra 

with implied constant depending on e. Thus we may assume without loss of generality 
that ^ on the support of n, without significantly affecting (16.11) . 

Write P = P(ui,... ,u r ; N] ,..., N r ) denote the nilprogression P := HP/P. Let HP 
be the ultra coset nilprogression formed from HP by removing those generators u t for 
which the associated dimension TV,: is bounded; as HP is in normal form, it is not difficult 
to see that HP is also an ultra coset nilprogression in normal form. For e > 0 a small 
enough standard quantity, one has HP t = HP 1 for all t < e, and so ||g[| H p_ Y = ||.9 ||hp ! y 

on the support of /j. Thus, by replacing HP by HP, we may assume without loss of 
generality that all dimensions N t are unbounded. 


The next step is to represent the coset nilprogression HP locally by a nonstandard 
Lie group. By repeated use of the upper triangular property (i), we see that for any 
nonstandard integers Gq, ..., a r , b\,..., b r , we have a multiplication law 


——dp 

Ui 






= < i( “' k| .. .«*■<“■*> 


where a := (ai,..., a r ), b := (bi,..., b r ), and P \,..., P r : *Z r x *7L r —> *Z are poly¬ 
nomials of bounded degree; furthermore, if = O(Nj) for all i = 1 ,...,r, then 
P t (a,b) = O(Ni) for all i = l,...,r. 


We may uniquely extend the polymomials P \..... P r to be polynomials P\,..., P r : 
*K r x —*■ *M of bounded degree; by interpolation (and the unbounded nature of N/) 
it remains the case that if a ±,..., a r , b \,..., b r are nonstandard reals with a l: bi = 0(N/) 
for all i = 1then Pi(a,b ) = 0(N\) for all i = l,...,r. Since Pj(0,0) = 0, 
this implies by further interpolation that if ai,bi = O(eNi) for some 0 < e < 1, then 
Pi(a,b ) = 0(eNi). From the local properness of HP, we conclude the associativity 
property 

P(P(a,b),c) =P{a,P(b,c)) (6.2) 

whenever a*,6j,Cj = 0{eN/) for i = l,...,r and a sufficiently small standard e > 0, 
where P : *R r x *M r —» *M r is the polynomial map P := (. P\,...,P r ). Since the 
Ni are unbounded, this implies from interpolation that the associativity law (I6.2[) is 
in fact valid for all a,b,c e *M r . We can thus create a nonstandard Lie group L by 
setting L to equal the nonstandard vector space with multiplication law given by 
P. From the upper triangular property (i) it is easy to see that this multiplication law is 
nilpotent, and so L is a nonstandard simply connected Lie group. In particular, we have 
a bijective exponential map exp : [ —> L from the nonstandard Lie algebra l (which as a 
nonstandard vector space is simply *M r ) to L, inverted by a logarithm map log : L — » l. 
We also have a local representation map </> : HP Ce —> L defined (for some suitable large 
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standard C independent of e) by 

4>(x) := (ai, ...,a r ) 

whenever x s HP Ce and x mod H = with |a*| « eNi (with implied con¬ 

stant independent of e); this is well-defined by local properness if £ is sufficiently small 
depending on (7, and from construction of L we have the local homomorphism property 

(p{xy) = <j>(x)(j>(y) 

for x, y e HP Ce / 2 ; also, H lies in the kernel of </>. Finally, from construction we see that 

\\x\\hp - II log0(x)||[ (6.3) 

for x e HP Ce , where ||(|[ is the usual (nonstandard) Euclidean norm on [ = *R r . 


Let g be chosen at random using the (nonstandard) probability measure y. By hypoth¬ 
esis, we can then associate a random permutation a : X —> X and random elements 
h x e HP for x e X such that 

\h x \up ^ \\g\\HP,x 


and 


gx = a(x)h x 

for all x & X. If there are multiple choices for a, we choose amongst them uniformly at 
random. The symmetry of y then shows that the random tuples 


(<T, {hx) x sX ) 


and 

( a (h<r-i(x)) xeX ) 

have the same distribution. From fl6.ll) . (16. 3 1) we have 

E|| log0(/r s )||f « l/n (6.4) 

for all x 6 X. 


For any i. j e S, we let p t j e *[0,1] denote the (nonstandard) probability that <j(i) = j, 
and let a t j e 1 denote the conditional expectation 

a%j = E(log(0(/ij))|<j(i) = j ) (6.5) 

(with the convention that a t j = 0 if p VJ = 0). Observe from the symmetry property 
mentioned above that Pij is a (nonstandard) symmetric stochastic matrix, in the sense 
that the p tJ are non-negative nonstandard reals with p tJ = p 3i and 

2 Pv = 1 

HX 

for all i e S. Also, we have the crude bound 


a tJ « 1 


for all i,j , and a further application of symmetry shows that a tJ 
and Cauchy-Schwarz, we also have 


ckj « Pi 


- 1/2 



— a,ji. From (16.11) 


(this bound is vacuously true if p l3 = 0. 
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The quantities reflect a certain amount of “drift” in the random walk associated 
to /x. It would be very convenient if these quantities vanished, or at least obeyed the 
cancellation condition 

Y, a ijPij = 0 

je-X 

for all i e X. Such a cancellation need not occur a priori. However it is possible to 
effectively obtain such a cancellation after a suitable “gauge change”, which essentially 
reflects (a continuous version of) the freedom to right-multiply elements of X by small 
elements in HP (which was already exploited in Sections El [5]) . To perform this gauge 
change, we will need the following linear algebra lemma. 


Lemma 6.2. Let P = (Pij)i^i,j^d be a (nonstandard) symmetric stochastic matrix of 
dimension d = 0(1), thus Pij are nonstandard non-negative reals with p i3 = p 3i and 
Yjj=\Pij = 1 f or Let be an anti-symmetric matrix such that = 

0(min(l, SpfL" 2 )) for all i,j and some 5 > 0. Then there exist ti = 0(1) fori = 1,. .., d 
such that 

d d 

a ijPij = ti — ^ Pijtj ( 6 - 6 ) 

3 =1 3 = 1 


for all i = 1,..., d, and 


y/Pij (fi ~tj) = 0(5) 


(6.7) 


for all i, j. 


Proof. By a compactness argument we may assume that all the p %3 are strictly positive, 
which means that P has a simple eigenvalue at 1 (with eigenvector consisting of the 
constant vector with all entries 1/yd). Note that the vector Q]j=i a ijPij)i=i sums 
to zero and is thus orthogonal to the constant eigenvector. If we let U2, ■ ■ ■, Ud be an 
orthonormal eigenbasis of the remaining eigenvalues A 2 ,..., A<j, then we have an explicit 
solution 

d ^ d d 

‘■-StAaSZ Q , ljPlj'U‘k,l)'U‘k,i (6.8) 

k =2 1 ~~ Ak 1=1 j = 1 

to (16.6(1 . where u kl i is the i th coefficient of Uk- We claim the estimate 

Pij(u k ,i - u kjj ) = 0(1 - A fc ) (6.9) 

for all k,l,j. Assuming this estimate, we see from the anti-symmetry of aij and the 
bound aij = 0(1) that 

d d 

CLljPljUkj 0(1 X k ) 

1=1 3=1 

which together with the bounds u kj i = 0(1) and (16.8(1 gives the bounds ti = 0(1). Next, 
for any i , i' we have 

d 1 / d d \ 

ti t {/ | — ■ I ^ ] aijPijU k j j (ukj ukj') 
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thanks to fj6.8j) - On the one hand we have the trivial bound u kji — u k j' = 0(1), while 
from (16.9(1 we have u kj i — u k ^ = 0((1 — A k)/pij)] we take the geometric mean to obtain 

u k ,i - u kii > = 0((1 - A k) 1/2 /pjj 2 )- (6.10) 

Similarly, from the anti-symmetry of aij and the bound aij = 0(5.^fpij) we have 

d d / d d \ 

22 a ljPlj u k,l O 22 d-\/Pij | | , 

1=1j=l \l=lj=l ) 

and hence by (I6.10P 

d d 

22 dijPijU-'k.i 0((1 Afc) S ). 

i=ij=i 

Putting all this together, we obtain (16.71) . 

It remains to show (16.91) . The eigenvector equation gives (1 — P)u k = 0(1 — \ k ), thus 

d 

Piji.'U'kji ^k.j ) 0(1 Afc) 

0=1 

for all i = 1,..., d. If we sort the u k ,i in decreasing order, 

U k,ik .1 ^ U k,ik ,2 ^ ^ U k,i k ,d 

and apply the above estimate with i equal in turn to i k} i,..., i k ,d and exploit the sym¬ 
metry of pij , we conclude that 

PijiP'kji ^k,j) 0(1 Afc) 

for all k,i,j, and the claim follows. □ 


We can iterate this lemma in the nilpotent Lie group L to obtain 
Corollary 6.3. There exists ti e l for i e X with 

U = 0 ( 1 ) 


and 


for all i,j e X, such that 



p i:j log (exp (-t,-) exp(ajj) exp(b)) 

oex 


for all i s X. 


0 


( 6 . 11 ) 

( 6 . 12 ) 


Proof. Let 

[ = b 1 2 r> l a r> l) s+ i = {0} 

be the lower central series for the nilpotent Lie algebra l. We will prove inductively that 
for any 1 < k ^ s + 1, there exist ti obeying the bounds (16.111) . (I6.12p such that 


2 ] Pij log(exp(—t,) exp(a^) exp(U)) e l k 

jeX 


(6.13) 
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for all i s X. Taking k = s + 1 will then give the claim. 


The case k = 1 is trivial (just set U = 0 for all i), so suppose inductively that the claim 
has been proven for some 1 < k < s, and that we seek to prove the claim for k + 1. 
With ti as in the induction hypothesis, let n be a linear projection from I to h with 
bounded coefficients, and write 


bij := 7r(log(exp(—t,-) exp(a^) exp(L))). 

Then is anti-symmetric. From the bounds on a^, U and the Baker-Campbcll-Hausdorff 


formula, one has 
and 


for all i,j. By Lemma [6.21 applied to the coefficients of b %3 , we can fold Vi s L for is X 
such that 

Vi = 0 ( 1 ) 

and 


bij = 0 ( 1 ) 


bij = O ( P i3 1/2 ^= 


y/PijiVi - Vj ) = O (^=^j 


for i,j e A", and 

d d 

yi bijPij=ti— ^ p^tj 
j= 1 i =1 

for all is X. If one then sets 

t[ := U + Vi 

then the inductive claim follows from multiple applications of the Baker-Campbell- 
Hausdorff formula. □ 


Let ti be as in the above lemma. We now build a “smooth bump function” T : G —>• *R 
by the formula 

'I'(s) := 2 ^(e _1 log(exp(-O)0(s _1 T))) 

sg X:xesHP Ce 

for some sufficiently large standard O, where ifj : ( —> is a non-negative smooth 

compactly supported function independent of e bounded away from zero near the origin, 
with all derivatives bounded. 


The function T is supported in XHP Ce , is bounded by 0(1), and is » 1 on a set of 
cardinality » £ \HP\ (where the subscript denotes the fact that the implied constant can 
depend on s), and so 


and 


y\\i* 




e \HP\ 1/2 

(6.14) 

^e \HP\ 

(6.15) 


Now we claim that T is essentially stable under convolution by 
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Lemma 6.4. We have 

II/i* - m P « e -\hp\ 1/2 . 

n 

Proof. Given that /a * T — \l/ is supported on XHP°( £ \ it will suffice to show that 

/j * 'fr(sh) — 'fr(sh) = O e (l/n) 
for all s e X and h e HP°^ e \ 

We have 

*(sh) = (log (exp (~t s )<f>(h)). 

Let g be drawn at random using /i, and let the random variables a and h s be chosen as 
above. Then we have 

fj, * 'ft(sh) = ET(gsh) 

= K^(a(s)h s h) 

= E'0(e _1 log(exp(—G (s ))0(^s) exp(f s ) exp (-t a )<j>(h))) 
and hence by Taylor expansion 

fj,*'f!(sh)- i Zf(sh) = EA(log(exp(—t cr ( s) )0(h s ) exp(f s )))+0(E|| log(exp(-t a ^))4>(h s ) exp(f s ))|| 2 ) 

where A : l —» *M. is the derivative of x >-> ^(e -1 log(exp(x) exp(— t s )<f>(h))) at x = 0. 

By (16.41) . (I6.12p the error term here is O(-), which is acceptable. As A is linear with 
coefficients 0 £ ( 1), it suffices to show that 

Elog(exp(—f cr ( s) )0(/i s ) exp(G)) = 0 £ (l/n). 

The left-hand side can be expanded as 

2] p ss 'E(log(exp(-£ s /)0(/i s ) exp(G))|a(s) = s'). 

s'eX 

By the Baker-Campbell-Hausdorff formula, the expression log(exp(— t s >)4>(h s ) exp(£ s )) 
is an affine function of log 4>(h s ), up to an error of 0 £ (\\(p(h s )\\ 2 ). By (16.51) . (16.41) . we may 
thus write this expression as 

2] Pss' log (exp (—t s /) exp(a ss /) exp(G)) + O e (l/n). 

s’eX 

By Corollary 16.31 the sum vanishes, and the claim follows. □ 


Iterating this lemma using Young’s inequality and the triangle inequality, we see that 

|/ i *l ,5n J * vp — vj/| £2 « e S\HP\ 1/2 
for any 6 > 0, and hence that 

(n* [Sni , T * \j>) » e || T ||| 2 

for some standard <5 > 0 depending on e, where T((/) := T(g _1 ). Since T*'L is supported 
on XHP°& and has magnitude 0(\HP\), we conclude from (16.141) that 

H*\- 5n \(XHP) » £ 1 
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and hence by the pigeonhole principle we have 

jj*^ n \{sHP) » e 1 

for some s e A'. Convolving p* Sn with itself (and using symmetry and the identity 
(, sHP)~ l HP = HP 2 ) we conclude that 

^*2[5nj(#p2) >>£ 1 

and thus on further convolution one has 

^2m[6n\( H p2mj y> ^ 

for any standard natural number mn. By Cauchy-Schwarz (and the fact that HP is an 
ultra approximate group, which comes from the normal form and unbounded dimensions 
of HP), this implies that 

II *2m[8n\ 11—2 „ \fjp\ 

|| p ||^2 ^ e,m l 11 r | 

and hence by monotonicity (and choosing m large enough depending on 5) one has 

W n \\p «e,8 \HP\ 

which gives the required claim. 


7. Abelian inverse Littlewood-Offord theory 


We now use Theorem 11.121 to reprove Theorem 11.31 The key input is the following 
Fourier-analytic fact. 


Proposition 7.1. Let G = (G, +) be an abelian group. Let pi,..., p n be symmetric 
discrete probability measures, and define 

l, 1 , , 

T ■ = + 2^C^l * Li + ■■■ + Ln* 

thus p is another symmetric probability measure. Then 


sup /i] * ■ ■ ■ * Hn({x}) < ^ n ({0}) ^ fil * ■ ■ ■ * MM) 

IE G 


where 

fij := e _1 M + (1 — e“ 1,/2 )/.q- * pj. 


Proof. By a limiting argument we may assume that p\,..., p n are finitely supported, 
and then by Freiman isomorphism we may take G to be finite. 

Standard Fourier analysis shows that 

SUp pi *■■■* p n {{x}) 
xeG 


3 = 1 


is bounded by 
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By the arithmetic mean-geometric mean inequality, we may bound this by 

1 n 

It . 1 
3 =1 

Bounding |x| ^ | + ||x| 2 , this is bounded by 

1 1 n 

«W 2 + &2>«)iT 

i=i 

which equals 

e^gA(O b = /** n ({0}). 

Next, we bound x ^ exp(—(1 — x)) for x ^ 1 to bound this by 

1 n 

3 = 1 

which can be rearranged as 

n 

- iAi(oi 2 ))- 

i=i 

We can bound exp(—|x) ^ 1 — (1 — e _1,/2 )x for 0 ^ x ^ 1, so we may bound the above 
by 

n 

^n^ + O-^IA,-®! 2 ) 

3 = 1 

which equals 

n 

^gg n Aj(o 

3=1 

which equals pi* ■ ■ ■ * /i n ({0}). The claim follows. □ 


Now we can prove Theorem 11.81 Let G,n,v i, • • •, v n , p, £i, • • •, £ n , A, n', £ be as in that 
theorem. We may assume that n is sufficiently large depending on e, A, since the claim 
is trivial otherwise. For i = 1,..., n, let p t be the Bernoulli probability distribution 
that takes the values Vi, —Vi with equal probability, then 


p = suppi * • • • * p n ({x}) 

xeG 


and thus by (17.11) one has 


h* n ({0}) > P > n 


—A 


and hence by Young’s inequality 


\P lh 2 (G) ^ P ^ n 


Applying Theorem 11.121 (with e replaced by e/2, and using additive group notation), 
we can fold a coset progression H + P of rank at most CA, e , and a finite subset A" of G 
containing the origin of cardinality at most Ca,s, such that 

\H + P\ « C Af p-' 


(7.1) 
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I 


and such that 

M h+p,x M x ) < 

)G\E n 

for some exceptional set E with 

MU < 

Here Ca, e is a quantity depending only on A and e. 

Since G is assumed to be torsion-free, H is trivial. By Chebyshev’s inequality, the 
hypotheses on n', and the definition of /i, we then have 

,12 

\\ V i \\P,X ^ / 

n 

for all but at most n' of the i e {1,..., n) and some C' A £ depending on A, e. 

Let i be as above, and set m := [(n') 1 / 2 ], then from (II. ID there is a permutation 
a : X —> X such that 


|Vi + X — cr(x)||p ^ 


rtn 

U A,£ 


m 


for some C" A £ depending on A, £ and all x e X , which in particular implies upon iteration 
and telescoping that 

{vi, 2vi ,..., mvi) cl + C Ae P. 

By [2S, Theorem 1.10], we can find place X + C" Ae P inside an arithmetic progression 
P 1 = {n\Wi + ■ ■ ■ + n r w r : |nj| ^ N^i = 1,..., r} 
of rank r < C"l F and size 

\P'\ < C'lJp 

for some C Ae depending on A, e, which is 2-proper in the sense that the n\W\ + - ■ ■ + n r w r 
with \ni\ ^ 2N t are all distinct. Since 

{vi, 2 v h ..., mvi} c P' 

we see (by arguing by contradiction) that Vi must lie in the progression 

{rtiWi + • • • + n r w r : |rij| ^ Ni/mVi = 1,... ,r}. 

If there are r' choices of Ni for which N, t ^ m, then we see that this progression has 
rank r' and cardinality at most C'^ , ' £ |P , |/m r for some C Ae depending on A,e. Theorem 
11.31 follows. 


8. Littlewood-Offord type theorems 
In this section we prove Theorems 11.15111.161 

We first prove Theorem 1 1.151 Let p be the uniform probability measure on Hi,..., A n , A ]“ 1 ,..., H” 1 , 
then p is symmetric and 

sup P(H / 1 ...A' n = B)= \\p* n \\ e co( G y, 

BeG 
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splitting [i* n = n* n ' * n * n '*//** with n' = [n/ 2J and i e {0,1} and using Young’s inequality, 
we conclude from (11.91) that 

* E^/n 

or equivalently 

W^'Wl^G) < £ V™- 

Applying Theorem 11.121 (with d = 1, n replaced n', and e replaced by (say) 1/2), we 
obtain a coset nilprogression HP of rank and nilpotency class at most C in C-normal 
form, and a finite subset X of G of cardinality at most C containing the identity, such 
that 


\HP\ CeVn 


and such that 


C 


I Ml hp,x < 

JG\E n 


for some exceptional set E with 


C 


H(E) < — < C 1/2 e 2 


n 


(8.1) 

( 8 . 2 ) 

(8.3) 


for some absolute constant C. From (18.2)) . Chebyshev’s inequality, and (18.3)) . we see 
that we have 

1 

T \\HP,X < 77- 7 = 

Ce^/n 

for all x outside of a set of //-measure at most C l ^ 2 e 2 + (7 3 e 2 , thus 

|| AII HP,A < 77 - 7 = 

Gey?/ 

for all but at most C'e 2 n choices of i = 1 ,,n, for some absolute constant C'. For 
each such i, we see from (II.ip that there exists a permutation cq : X —» X such that 

||ct(t) - 1 At||hp < 

Ceyjn 

for all x 6 A". Since \P\ ^ \HP\ < Ce^fn, we conclude that 

a(x)~ 1 A i x e H 

for all x 6 A". Thus A lives in the group H' := {g e G : gXH = XH}, which is a group 
of order at most |A||i7| ^ C 2 e^/n. The claim follows. 

Now we prove Theorem 11.161 By reducing n by one if necessary (and adjusting e 
slightly), we may assume that n is even, then by Young’s inequality 


lA n/2 ||/ (G ) < n 


d-\~ 1 —£ 


and so by Theorem 11.121 as in the proof of Theorem 11.151 we can find a coset nilpro¬ 
gression HP in CA-normal form of rank and nilpotency class at most CA with 

\HP\ ^ C d , e n d+l ~ £ (8.4) 

and a set A c (7 of cardinality at most Cd, e such that the set 

CA 
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has /i- measure at least 1 — e, where C^ £ depends only on d, e. 

Write HP/H = P{y i,..., v r \ Ni ,..., N r ). We may delete any generator Vi with TV® < 
C d l-\fn (without significantly affecting the C^-normal form), since this does not alter 
F. If g e F, then by (jl.lj) there is a permutation a : X —» X such that a(x)~ 1 gx & ( HP ) 
for all x e A", so that gX(HP) = X(HP}. The group {g : gX(HP ) = X(HP}} contains 
( HP ) as a finite index subgroup (since this is a stabiliser of the action of this group on 
the finite space X(HP)/(HP)), so it will suffice to show that ( HP ) has growth at most 
d. Quotienting by H, it suffices to show that v\,... ,v r generate a group of growth at 
most d. 


Let K = < HPy/H be the group generated by the vi,..., v r . This is a nilpotent group. 
Inductively using the upper-triangular property of nilprogressions in C'-normal form, 
we see that the j th term Kj in the lower central series of K is generated by a subset of 
{vi : Ni 5 s Cjl £ n^ 2 } for some Cj^e depending on j, d, e. By the Bass-Guivarc’h formula 
mm, the order D of growth of K is then at most 

2 *{v< ■■ N, CjXN*} 

1 


and hence 


n^icy-n n niri 

i=l 1 ^i^rJVpCr. 1 nH 2 

% J,d,£ 

> {C^r l n D 

for some C' d e , C" d E depending on d,e. On the other hand, from the volume bound for 
nilprogressions in normal form and (18.4ft we have 


n n ‘ < cap*'-' 

i=l 


for some C d e depending on d, e. For n large enough, we thus have D ^ d, and the claim 
follows. 
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